python - 简单的 python 网页抓取与登录
问题描述
我似乎无法连接到网站来抓取它。我想知道我是否做错了什么?我在另一个网站上尝试了相同的代码并且它可以工作,但我没有收到任何身份验证错误......我将不胜感激任何帮助,谢谢!
import requests
from bs4 import BeautifulSoup
login_url = 'https://secure.selfwealth.com.au/Account/Login?ReturnUrl=%2F'
session = requests.Session()
page = session.get(login_url)
soup = BeautifulSoup(page.text, 'html.parser')
token = soup.find(attrs={"name": '__aft'}).get('value')
payload = {
"email": "bob@gmail.com",
"password": "mypwd",
"__aft": token
}
result = session.post(login_url, data=payload)
page = session.get(login_url)
#print(result.ok)
soup = BeautifulSoup(page.content, 'html.parser')
results = soup.find('div', class_='daily-perf')
results2 = soup.find("title")
#marketvalue = results.find('div', class_='price-change')
print(results)
print(results2)
print(payload)
解决方案
import requests
from bs4 import BeautifulSoup
data = {
"Email": "your@email.com",
"Password": "yrpass"
}
headers = {}
def main(url):
with requests.Session() as req:
r = req.get("https://secure.selfwealth.com.au/Account/Login")
soup = BeautifulSoup(r.content, 'html.parser')
headers["X-XSRF-TOKEN"] = soup.find("input",
{"name": "__aft"}).get("value")
r = req.post(url, json=data, headers=headers).json()
print(r)
main("https://secure.selfwealth.com.au/api/login")
推荐阅读
- python - 如何根据 python 中的用户输入打印列表中的某些项目?
- node.js - Nodejs自动登录如果存在cookie记住我
- sql - MS Access SQL 在行数据和列名上复制 Excel vlookup
- java - 活动开始时在底部强制滚动视图
- reactjs - 源代码管理中的 Visual Studio 代码片段
- java - How do you use your interface's private methods?
- javascript - 如何向 Mongoose Schema 添加新密钥?
- python - Python pyautogui.click 是右键单击而不是左键单击
- ios - Swift 4 图像选择器不改变 UIImageView
- javascript - Jquery效果函数调用在javascript中不起作用