首页 > 解决方案 > 如何使用 BeatifulSoup 的“find_previous_siblings”来查找标签?

问题描述

标签: pythonweb-scrapingbeautifulsoup

解决方案


他们不是兄弟姐妹,因为 strong 在另一个段落标签内,p.

我想你想要find_previous

from bs4 import BeautifulSoup, element
import requests

url = 'http://www.mpsp.mp.br/portal/page/portal/DO_Estado/2020/DO_20-06-2020.html'
page = requests.get(url)
soup = BeautifulSoup(page.content, 'html.parser')

for i in soup.body.contents:
    if isinstance(i, element.NavigableString):
        continue
    if isinstance(i, element.Tag):
        texts = i.text
        if texts == 'HELENA BONILHA DE TOLEDO LEITE':
            print(i.find_previous('h1'))
            print(i.find_previous('strong'))
            print(i)

文档解释find_previousfind_all_previoushttps ://www.crummy.com/software/BeautifulSoup/bs4/doc/#find-all-previous-and-find-previous


推荐阅读