首页 > 解决方案 > R查找按访问ID细分的总计数

问题描述

在 R 中,我有一个数据框:

df <- read_delim("Visit_ID | Visit_Count | Cluster  
A  | 2 | orange    
A  | 2 | green
B  | 2 | green  
B  | 2 | green    
C  | 3 | orange    
C  | 3 | orange    
C  | 3 | green    
D  | 3 | orange  
D  | 3 | green    
D  | 3 | orange", delim = "|", trim_ws = TRUE)

我想如何按访问频率细分每个集群的 Visit_ID 计数。生成的数据框应如下所示:

df_result <- read_delim("Cluster | VisitID_Frequency | Total_count  
Orange  | 1 | 1    
Orange  | 2 | 2
Orange  | 3 | 0
Green   | 1 | 3
Green   | 2 | 1
Green   | 3 | 0
", delim = "|", trim_ws = TRUE)

标签: rdplyr

解决方案


df %>% group_by(Visit_ID, Cluster) %>% 
                summarise(visit_count = n()) %>% 
                  arrange(Cluster) %>% 
                    group_by(Cluster, visit_count) %>% 
                      count()

推荐阅读