第7章的课后作业和漏讲内容的代码

来源:8-1 并发和并行

翻版郭富城

2019-04-17

    for answer_item in all_divs[1:]:
        answer = Answer()
        answer.topic_id = topic_id
        answer_id = answer_item.css("[id^='post-'] ::attr(data-post-id)").extract()[0]#获取每个答贴的ID
        answer_item.css(".nick_name a::text").extract()[0]
        author_info = answer_item.css(".nick_name a::attr(href)").extract()[0]
        author_id = author_info.split("/")[-1]
        content = answer_item.css(".post_body.post_body_min_h ::text").extract()[0]
        create_time = answer_item.css(".date_time ::text").extract()[0]
        create_time = datetime.strptime(create_time, "%Y-%m-%d %H:%M:%S")
        praised_num = answer_item.css(".red_praise.digg em::text").extract()[0]

        answer.author = author_id
        answer.content = content.strip()
        answer.create_time = create_time
        answer.praised_nums = int(praised_num)
        answer.id = int(answer_id)

        existed_answer = Answer.select().where(Answer.id == answer.id)
        if existed_answer:
            answer.save()
        else:
            answer.save(force_insert=True)
all_ems = sel.css(".my_fans_bar li em::text").extract()
follower_nums = all_ems[0]  #获取粉丝数
following_nums = all_ems[1] #获取关注数

author.follower_nums = int(follower_nums)
author.following_nums = int(following_nums)
写回答

1回答

bobby

2019-04-18

不太明白你的意思

0
0

Python爬虫工程师实战 大数据时代必备

慕课网严选精品教程,高质量内容+服务!

2377 学习 · 1158 问题

查看课程