首页 > 解决方案 > 有没有办法使用 Reddit ExtractoR 只查看帖子标题并排除评论?

问题描述

标题:我希望使用 Reddit ExtractoR 从 2021 年 1 月 1 日以来的特定 subreddit 中提取所有帖子标题。我不知道如何引导该功能只查看帖子标题而不添加任何评论。我在下面发布了我的代码的当前版本:

reddit_test <- get_reddit(search_terms = NA, regex_filter = "", subreddit = "subreddit_name",
           cn_threshold = 1, page_threshold = 12000, sort_by = "new",
           wait_time = 2)

标签: rreddit

解决方案


get_reddit()函数似乎总是返回注释。

看github https://github.com/ivan-rivera/RedditExtractoR

reddit_urls()函数将返回更少的属性,标题是其中之一:

> reddit_links <- reddit_urls(   search_terms   = "cute_cats", page_threshold = 1 )
> 
> 
> str(reddit_links) 'data.frame':   25 obs. of  5 variables:  
> $ date      : chr  "05-02-15" "24-02-14" "03-09-13" "20-05-14" ...  
> $ num_comments: num  214 26 221 36 44 41 93 199 20 175 ...  
> $ title     : chr  "My brother's cat is insanely cute!" "...  
> $ subreddit  : chr  "cats" "cats" "cats" "cats" ...  
> $ URL         : chr "http://www.reddit.com/r/cats/comments/2uv9q5/my_brothers_cat_is_insanely_cute/?ref=search_posts" ...

推荐阅读