首页 > 解决方案 > 来自csv文件的beautifulsoup多个关键字

问题描述

我有一个包含 2 列 A 和 B 的 csv 文件,我想用 beautifulsoup 报废所有文件

url 的组成如下:http://.../search?info=A&who=B
如何创建循环?

我的代码

from bs4 import BeautifulSoup
import requests
import json
import csv

with open('input.csv') as csvfile:
    reader = csv.reader(csvfile) 
    for row in reader:
        url = ".../search?info={}&who={}".format(row[0], row[1])
        response = requests.get(url)
        html = response.content
        soup = BeautifulSoup(html, "html5lib")

        for p in soup.find_all(class_="crd"):
            b = p.find(class_="info")
            if b['data-info'] is not None:
            j = json.loads(b['data-info'])
            data= p.h2.a.string

标签: pythonweb-scrapingbeautifulsoup

解决方案


import csv
with open('input.csv') as csvfile:
  reader = csv.reader(csvfile) 
  for row in reader:
    url = url = ".../search?info={}&who={}".format(row[0], row[1])
    #rest of your logic

推荐阅读