首页 > 解决方案 > Python - 我尝试用scrapy抓取项目,但是图像链接没有抓取

问题描述

我是 python 和网络抓取的新手,我尝试从这个网站抓取内容,但是当我运行爬虫时我无法获取图像。

这是spider.py:

class TestSpiderSpider(scrapy.Spider):
name = 'testscraper'
page_number = 2
start_urls = ['https://jamaicaclassifiedonline.com/auto/cars/']

def parse(self, response):
    items = scrapeItem()

    product_title = response.css('.jco-card-title::text').extract()
    product_imagelink = response.css('.b-loaded::attr(src)').getall()

    items['product_title'] = product_title
    items['product_imagelink'] = product_imagelink

    yield items

他是 items.py 的代码:

class scrapeItem(scrapy.Item):
product_title = scrapy.Field()
product_imagelink = scrapy.Field()

pass

标签: pythonweb-scrapingscrapy

解决方案


response.css('.card-image img::attr(src)').getall() # images.
response.css('.card-image img::attr(data-src)').getall() # lazy-loaded images.

推荐阅读