r - 在另一个变量中分组的一个变量的唯一值的频率 - R?
问题描述
极端新手问题:我有2个变量,地区ID和家庭ID,地区内有重复的家庭。我只是想找出每个地区有多少独特的家庭。
这就是我正在尝试的:
library(dplyr)
table <- data %>% group_by(region) %>% summarise(hid = unique(hid))
错误消息:错误:列
hid
的长度必须为 1(汇总值),而不是 142
解决方案
这样的事情可能会让你得到你想要的:
library(tidyverse)
df <- tibble(region_id = c(1, 2, 3, 1, 2, 3),
household_id = c("a", "b", "b", "a", "a", "b"))
df %>%
group_by(region_id) %>%
count(household_id) %>%
summarize(unique_households = n())
推荐阅读
- javascript - Why highcharts replaces symbol 9 by -
- javascript - 如何打印reactjs中每个项目的所有评论列表的数量?
- javascript - 有没有办法用我自己的域名而不是 Gmail 邮件合并
- time-series - Multivariate Time series forecasting model using XGBoost and LSTMs for different time-series
- wordpress - What's the hook for modifying the html code added by Wordpress via Add Media Insert from URL?
- python - Getting different y-axis range using plot/scatter on matplotlib
- node.js - "npm ci" command causing longer build time on gitlab than the "npm i"
- python - 如何将自定义对象序列化/编码为字节/字节数组?
- c++ - 在 CIMG 中附加图像并保存在 stb_image_write 中不起作用
- android - Android non-square hit areas. Can it be done?