首页 > 解决方案 > 尝试从 Netflix 下载图像时出现请求错误

问题描述

好的,我是 python 新手,我打算学习一点关于网络抓取的知识,这对我来说更难,因为我对 Web、JS、HTML 等一无所知。我的想法是下载一些可用的图像Netflix目录。这甚至适用于前 5 或 6 张图像

import requests, os
from bs4 import BeautifulSoup

url = "https://www.netflix.com/br/browse/genre/839338"
page = requests.get(url)
page.raise_for_status()

soup = BeautifulSoup(page.text)

img_element_list= soup.select('a img')
print(f'Images avalaible (?) : {len(img_element_list)} ')
quantity = int(input('How many images: '))

for c in range(quantity):
    name = img_element_list[c].get('alt')
    print('Downloading ' + name + ' image...')
    img_response= img_element_list[c].get('src')
    print('SCR: ' + img_response + '\n\n')
    
    img = requests.get(img_response)

    file = os.path.join('Images', name)
    img_file = open(file+'.jpg', 'wb')
    for chunk in img.iter_content(100000):
        img_file.write(chunk)
    
    img_file.close()

但最后在第四或第五张图片被下载后,后续图片的 scr 会变成这样 'data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAAAAAAEAAAICTAEAOw==' 然后它会引发这个错误:

Traceback (most recent call last):
  File "C:\Users\1513 IRON\PycharmProjects\DownloadNetflixImg.py", line 20, in <module>
    img = requests.get(img_response)
  File "C:\Users\1513 IRON\AppData\Local\Programs\Python\Python37\lib\site-packages\requests\api.py", line 75, in get
    return request('get', url, params=params, **kwargs)
  File "C:\Users\1513 IRON\AppData\Local\Programs\Python\Python37\lib\site-packages\requests\api.py", line 60, in request
    return session.request(method=method, url=url, **kwargs)
  File "C:\Users\1513 IRON\AppData\Local\Programs\Python\Python37\lib\site-packages\requests\sessions.py", line 533, in request
    resp = self.send(prep, **send_kwargs)
  File "C:\Users\1513 IRON\AppData\Local\Programs\Python\Python37\lib\site-packages\requests\sessions.py", line 640, in send
    adapter = self.get_adapter(url=request.url)
  File "C:\Users\1513 IRON\AppData\Local\Programs\Python\Python37\lib\site-packages\requests\sessions.py", line 731, in get_adapter
    raise InvalidSchema("No connection adapters were found for '%s'" % url)
requests.exceptions.InvalidSchema: No connection adapters were found for 'data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw=='

标签: python-3.xweb-scrapingbeautifulsouppython-requests

解决方案


推荐阅读