cnblogs scrapy shell response 443

来源:4-12 提取详情页信息

朗月清风

2021-09-16

[s] Available Scrapy objects:
[s] scrapy scrapy module (contains scrapy.Request, scrapy.Selector, etc)
[s] crawler <scrapy.crawler.Crawler object at 0x10f2c48b0>
[s] item {}
[s] request <GET https://news.cnblogs.com/n/702424/>
[s] response <200 https://account.cnblogs.com:443/signin?ReturnUrl=https%3A%2F%2Fnews.cnblogs.com%2Fn%2F702424%2F>
[s] settings <scrapy.settings.Settings object at 0x10f2c4a60>
[s] spider <DefaultSpider ‘default’ at 0x10f646190>
[s] Useful shortcuts:
[s] fetch(url[, redirect=True]) Fetch URL and update local objects (by default, redirects are followed)
[s] fetch(req) Fetch a scrapy.Request and update local objects
[s] shelp() Shell help (print this help)
[s] view(response) View response in a browser

有人遇到类似问题吗?怎么解决的,谢谢🙏

写回答

3回答

bobby

2021-09-17

需要登录之后才能访问,先参考一下模拟登录,可以先看看知乎的模拟登录 然后再回过头来看看这个

1
1
摩卡星冰乐
已经登录还是出现这个
2022-03-27
共1条回复

bobby

2022-03-31

模拟登录成功了?有没有在settings中设置cookie_enabled?

0
0

朗月清风

提问者

2021-09-17

好的,谢谢您

0
0

Scrapy打造搜索引擎 畅销4年的Python分布式爬虫课

带你彻底掌握Scrapy,用Django+Elasticsearch搭建搜索引擎

5829 学习 · 6293 问题

查看课程