python - 使用关键字搜索 HTML TD
问题描述
我正在尝试在 HTML TD 中查找值。我已经阅读了很多并尝试了变化,但它没有点击。
这是我的 HTML 文档
<html>
<head>
<meta http-equiv="content-language" content="en-us">
<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">
</head>
<body style="font-family: Arial; font-size: 10pt; background-color:#eff7e8;" link="#3333ff" vlink="#3333ff" alink="#3333ff" topmargin="16" leftmargin="20">
<font color="#1984a8" style="font-size: 14pt">Property Details</font>
<table border="0" cellpadding="0" cellspacing="0" style="border-collapse: collapse; padding: 0; " bordercolor="#111111" width="1000px" height="60px" id="AutoNumber1" bordercolorlight="#693A4A" bordercolordark="#693A4A" height="1">
<tr>
<td height="1" style="padding: 2; " valign="bottom">
<font color=#000000 style="font-size: 12pt">Number of records found: </font><b><font style="font-size: 12pt; color: #FF0000">1</b></font>
</td>
</tr>
<tr>
<td height="1" style="padding: 2; " valign="top" width="100%" colspan="2">
<font align="justify" face="Arial" style="font-size: 10pt"><p align="justify">
One record is displayed for each address found at the selected property. Multiple addresses may occur in the case of condominiums.<br><br>
</p></font></td>
</tr>
</table>
<TABLE BORDER=0 CELLSPACING=1 style="font-family: Arial; font-size: 10pt"><tr><TD align=center width=412 bgcolor=#ffffff style='border: 1px solid #ededed'><font style='font-size: 11pt'><b>1250 TRUMBULL HWY</b> </font><br><font style='font-size: 10pt'>ID: <b><font style='font-size: 10pt'>250-10</b> </font><font style='font-size: 10pt'>Book: <b><font style='font-size: 10pt'>0311</b></font><font style='font-size: 10pt'> Page: <b><font style='font-size: 10pt'>1063</b></font></td><TD width=412 align='center'><a target='_blank' href='recordcard.asp?TY=0&ID=2732&PID=250-10'> <b>Property Card</b> </a><br></td></tr><tr><TD width=412 bgcolor=#ededed>ALTERNATE ID: <b>250-10</b></TD>
<TD width=412 bgcolor=#ededed>ACCOUNT NUMBER: <b>R0141550</b></TD></tr><tr>
<TD width=412 bgcolor=#ffffff>PARCEL ID: <b>250-10</b></TD>
<TD width=412 bgcolor=#ffffff>LOCATION: <b>1250 TRUMBULL HWY</b></TD></tr><tr>
<TD width=412 bgcolor=#ededed>BUILDING ASSESSMENT: <b>70120</b></TD>
<TD width=412 bgcolor=#ededed>IMPROVEMENTS ASSESSMENT: <b>71980</b></TD></tr><tr>
<TD width=412 bgcolor=#ffffff>LAND ASSESSMENT: <b>19140</b></TD>
<TD width=412 bgcolor=#ffffff>OTHER BUILDING ASSESSMENT: <b>880</b></TD></tr><tr>
<TD width=412 bgcolor=#ededed>EXTRA FEATURES ASSESSMENT: <b>980</b></TD>
<TD width=412 bgcolor=#ededed>TOTAL ASSESSMENT: <b>91120</b></TD></tr><tr>
<TD width=412 bgcolor=#ffffff>PRIOR TOTAL ASSESSMENT: <b>91120</b></TD>
<TD width=412 bgcolor=#ffffff>LAST SALE DATE: <b>2019-05-03 00:00:00.0000000</b></TD></tr><tr>
<TD width=412 bgcolor=#ededed>BOOK: <b>0311</b></TD>
<TD width=412 bgcolor=#ededed>PAGE: <b>1063</b></TD></tr><tr>
<TD width=412 bgcolor=#ffffff>LAST SALE VALUE: <b>95000</b></TD>
<TD width=412 bgcolor=#ffffff>OWNER: <b>SEIDL DAVID D</b></TD></tr><tr>
<TD width=412 bgcolor=#ededed>CO-OWNER: <b></b></TD>
<TD width=412 bgcolor=#ededed>MAILING ADDRESS LINE 1: <b>82 BASCOM RD</b></TD></tr><tr>
<TD width=412 bgcolor=#ffffff>MAILING ADDRESS CITY: <b>LEBANON</b></TD>
<TD width=412 bgcolor=#ffffff>MAILING ADDRESS STATE: <b>CT</b></TD></tr><tr>
<TD width=412 bgcolor=#ededed>MAILING ADDRESS ZIP: <b>06249</b></TD>
<TD width=412 bgcolor=#ededed>MAILING ADDRESS LINE 2: <b></b></TD></tr><tr>
<TD width=412 bgcolor=#ffffff>MAILING ADDRESS LINE 3: <b></b></TD>
<TD width=412> </td></tr><tr><TD width=412> </td><TD width=412> </td></tr><tr><TD width=412> </td><TD width=412> </td></tr><tr><TD width=412> </td><TD width=412> </td></tr><tr></tr><tr></TABLE>
<SCRIPT LANGUAGE='JavaScript'>
if (document.body.scrollHeight < 800) {parent.document.getElementById('frame_b').height = 800} else {parent.document.getElementById('frame_b').height = document.body.scrollHeight;}</SCRIPT>
</body>
</html>
我正在寻找提取包裹信息
<TD width=412 bgcolor=#ededed>BOOK: <b>0311</b></TD>
根据我在网上看到的人和各种迭代,这是我想出的,但我仍然得到一个空白列表。
import requests
from bs4 import BeautifulSoup
page = requests.get('http://www.mainstreetmaps.com/ct/lebanon/property.asp?TY=0&PID=250-10')
soup = BeautifulSoup(page.content, 'html.parser')
a = soup.findAll('td', text = "BOOK")
我使用的网站在这里:http ://www.mainstreetmaps.com/ct/lebanon/property.asp?TY=0&PID=250-10
解决方案
尝试类似:
for book in soup.select('td'):
if 'BOOK' in book.text:
print(book.select_one('b').text)
输出:
0311
推荐阅读
- python - 安装 Jupyter 时出错:命令出错,退出状态为 1:python setup.py egg_info 检查日志以获取完整的命令输出
- flutter - 发生异常。FormatException (FormatException: 意外字符 (在字符 1) ^ )
- mysql - 插入记录并增加其他记录的值
- php - PHP - 如何获取没有值的 $_GET 参数
- html - 让 MatDatePicker 显示为格式 01/01/2010
- javascript - 如何禁用弹出窗口阻止程序?
- firebase - 使用 Firebase Cloud Functions 为 Live App React Naive 定价的 Stripe Payment
- java - 如何在recyclerview中添加多个过滤器?
- python - Scrapy中的报纸文章刮刀
- axios - 如何在 Axios 中使用“preconnect”和“dns-prefetch”