首页 > 解决方案 > 我可以做些什么来让我的网络爬虫看起来更好吗?

问题描述

我得到了我想要的输出,但它到处都是,我怎样才能让它更干净?

import requests
from bs4 import BeautifulSoup

page = requests.get('https://www.titan.fitness/strength/dumbbells/rubber-coated-hex/rubber-hex-dumbbells-with-cast-iron-handle/HEXDBB-GROUP.html')
soup = BeautifulSoup(page.text, 'html.parser')

dumbbells_availability = soup.find(class_="set-items bundle-items container")
item = dumbbells_availability.find_all(class_="row product-detail set-item")
dumbbell1 = item[0]

availability = dumbbell1.find(class_="availability").get_text()
product_name = dumbbell1.find(class_="product-name").get_text()
print(availability)
print(product_name)

结果如下:

在此处输入图像描述

标签: python-3.xbeautifulsoup

解决方案


是的你可以。

只需使用参数设置为的getText()方法。stripTrue

例如:

import requests
from bs4 import BeautifulSoup

page = requests.get('https://www.titan.fitness/strength/dumbbells/rubber-coated-hex/rubber-hex-dumbbells-with-cast-iron-handle/HEXDBB-GROUP.html')
soup = BeautifulSoup(page.text, 'html.parser')

dumbbells_availability = soup.find(class_="set-items bundle-items container")
item = dumbbells_availability.find_all(class_="row product-detail set-item")
dumbbell1 = item[0]

availability = dumbbell1.find(class_="availability").getText(strip=True)
product_name = dumbbell1.find(class_="product-name").getText(strip=True)
print(availability)
print(product_name)

输出:

availability:Out of Stock.Notify me when this is in stockSign up
5 LB Rubber Hex Dumbbells

推荐阅读