首页 > 解决方案 > 初学者网页抓取代码迭代问题

问题描述

我是 Python 新手,非常感谢一些帮助!

我一直在尝试创建一个字典来将书籍分配给他们的作者,只是为了让它变得混乱并不断重复。

我怎样才能解决这个问题?

import requests
from bs4 import BeautifulSoup

url = "https://www.banyen.com/new-arrivals/index.html"
response = requests.get(url)
html = response.content
scraped = BeautifulSoup(html,'html.parser')
results = []

article = scraped.find("div", class_="block block-system block-odd clearfix")
for i in article.find_all():
    name = i.find("h2", "a href", class_="teaser-title")
    author = i.find("span", class_="price-amount")
    if name is not None:
        if author is not None:
          results.append({name:author})

print(results)

标签: pythonweb-scraping

解决方案


import requests
from bs4 import BeautifulSoup
import re

url = "https://www.banyen.com/new-arrivals/index.html"
response = requests.get(url)
html = response.content
scraped = BeautifulSoup(html,'html.parser')
results = []

articles = scraped.find_all("div", id=re.compile("node-"))
for i in articles:
    name = i.find("h2").find('a')
    author = i.find("span", class_="price-amount")
    if name is not None:
        if author is not None:
            results.append({name.text.strip():author.text})

print(results)

推荐阅读