首页 > 解决方案 > 我正在尝试用噩梦制作一个网络抓取项目

问题描述

我正在尝试用噩梦制作一个网络搜索项目我正在抓取的网站有很多像孩子(1),孩子(2)这样的孩子,所以我想刮掉这个子菜单中的所有细节,但由于很多像这样的东西

        <div class="child(1)">
            <h1 class="name">1</h1>
            <p class="discription">Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                delectus accusamus at totam!</p>
        </div>
        <div class="child(2)">
            <h1 class="name">2</h1>
            <p class="discription">Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                delectus accusamus at totam!</p>
        </div>
        <div class="child(3)">
            <h1 class="name">3</h1>
            <p class="discription">Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                delectus accusamus at totam!</p>
        </div>
        <div class="child(4)">
            <h1 class="name">4</h1>
            <p class="discription">Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                delectus accusamus at totam!</p>
        </div>
        <div class="child(5)">
            <h1 class="name">5</h1>
            <p class="discription">Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                delectus accusamus at totam!</p>
        </div>
它有像上面这样的结构,所以我需要把所有的东西都放在所有孩子里面,像 h1 和 p 一样

[
      {
      "child name":"child(1)"
      "hi":"1"
      "p":"Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                    cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                    delectus accusamus at totam!"
      }
{
      "child name":"child(2)"
      "hi":"2"
      "p":"Lorem ipsum dolor sit amet consectetur adipisicing elit. Unde in minus tempore quod
                    cumque cupiditate ipsam nostrum iste nihil quas! Repudiandae quos obcaecati eligendi, nostrum dolorum
                    delectus accusamus at totam!"
      }
    ]

标签: javascripthtmlcssnode.jsnightmare

解决方案


推荐阅读