首页 > 解决方案 > 美丽的汤和请求问题它不显示任何文本输出

问题描述

我正在使用美丽的汤并要求打印本网站文章的全文

https://www.vanityfair.com/style/society/2014/06/monica-lewinsky-humiliation-culture

这是我的代码:

import requests
from bs4 import BeautifulSoup

url = requests.get("https://www.vanityfair.com/style/society/2014/06/monica-lewinsky-humiliation-culture")
html = url.text
page = BeautifulSoup(html, 'html.parser')
match = page.find_all('div', 'parbase cn_text')
page_list = [[k.get_text() for k in i.find_all('p')] for i in match]

for i in page_list[:-2]:
   for k in i:
     print(k + '\n')

我的代码运行没有任何错误,但它没有在输出中显示任何文本请帮助我找到我的错误

标签: pythonpython-3.xpython-2.7web-scrapingbeautifulsoup

解决方案


要获取div具有类的文章数据选择article__chunks。属于它的文章数据。

import requests
from bs4 import BeautifulSoup

url = requests.get("https://www.vanityfair.com/style/society/2014/06/monica-lewinsky-humiliation-culture")
html = url.text
page = BeautifulSoup(html, 'html.parser')
match = page.find('div', {'class': 'article__chunks'})

page_list = [[k.get_text() for k in i.find_all('p')] for i in match]

for i in page_list[:-2]:
   for k in i:
     print(k + '\n')

推荐阅读