首页 > 解决方案 > 在 r 中按组完成时间序列

问题描述

我有一个数据框

dat <- data.frame(c("G", "G", "G", "G"), c("G1", "G1", "G2", "G2"), c('2017-01-01', '2017-01-03', '2017-04-02', '2017-04-05'))

colnames(dat) <- c('Country', 'Place', 'date')

我想要这个输出:(每个(国家/地区)组的完整日期)

dat <- data.frame(c("G", "G", "G", "G", "G", "G", "G"),
                  c("G1","G1", "G1", "G2", "G2", "G2", "G2"), 
                  c('2017-01-01', '2017-01-03','2017-01-03', 
                    '2017-04-02', '2017-04-03', '2017-04-04', '2017-04-05'))

我努力了:

dat = dat %>% group_by(Country, Place) %>% complete(date)

但它不起作用。谁能帮我这个?

标签: rdplyrtime-series

解决方案


你可以做:

dat %>%
  mutate(date = as.Date(date)) %>%
  group_by(Country, Place) %>%
  complete(date = seq.Date(min(date), max(date) , by= "day"))


# A tibble: 7 x 3
# Groups:   Country, Place [2]
  Country Place date      
  <fct>   <fct> <date>    
1 G       G1    2017-01-01
2 G       G1    2017-01-02
3 G       G1    2017-01-03
4 G       G2    2017-04-02
5 G       G2    2017-04-03
6 G       G2    2017-04-04
7 G       G2    2017-04-05

推荐阅读