python-3.x - 如何使用 Python 3 从网站下载 xlsx 文件?
问题描述
我正在尝试自动下载文件并保存。这应该很容易,但我发现了一些困难。
理论上这里应该很容易,你点击自动下载文件。
我尝试了不同的方法(如在此处或在此处输入链接描述的不同帖子中找到的)。这是我当前代码的几个示例:
选项 A)
url = "https://www.gov.scot/binaries/content/documents/govscot/publications/statistics/2020/04/trends-in-number-of-people-in-hospital-with-confirmed-or-suspected-covid-19/documents/trends-in-number-of-people-in-hospital-with-confirmed-or-suspected-covid-19/trends-in-number-of-people-in-hospital-with-confirmed-or-suspected-covid-19/govscot%3Adocument/HSCA%2B-%2BSG%2BWebsite%2B-%2BIndicator%2BTrends%2Bfor%2Bdaily%2Bdata%2Bpublication.xlsx"
response = requests.get(url,stream=False)
with open(dowload_folder_name, 'wb') as out_file:
shutil.copyfileobj(response.raw, out_file)
选项 B)
xl_df = pd.read_excel(url,
sheet_name='Table 5 - Testing',
skiprows=range(5),
skipfooter=0)
在这两种情况下,我都会得到
urllib.error.URLError: <urlopen error [Errno 60] Operation timed out>
请问有什么建议吗?非常感谢!
解决方案
import requests
def main(url):
r = requests.get(url)
print(r)
with open("data.xlsx", 'wb') as f:
f.write(r.content)
main("https://www.gov.scot/binaries/content/documents/govscot/publications/statistics/2020/04/trends-in-number-of-people-in-hospital-with-confirmed-or-suspected-covid-19/documents/trends-in-number-of-people-in-hospital-with-confirmed-or-suspected-covid-19/trends-in-number-of-people-in-hospital-with-confirmed-or-suspected-covid-19/govscot%3Adocument/HSCA%2B-%2BSG%2BWebsite%2B-%2BIndicator%2BTrends%2Bfor%2Bdaily%2Bdata%2Bpublication.xlsx")
推荐阅读
- reactjs - 如何导航到 App.js 中未声明的路径
- r - R:如何创建一个向量化函数,其中积分的上限是变量
- c - 如何防止二进制文件上的 LD_PRELOAD?
- java - 我如何处理请求中作为“@PathVariable”出现的“ñ”字符,Spring Boot
- javascript - 如何在 HTML 中创建一个多 y 轴图表,其中两个 Y 值都是用户输入?
- python - 在表格中打印文本,留下 n 行和列
- sql - 显示收入累积增加的列,然后仅在某个日期后重复最大值
- python - 浮动字符串列表
- c# - 如何在新选项卡中打开来自 Response MemoryStream 的 PDF
- flutter - 在文本字段上键入时,键盘会阻止颤动的视图