首页 > 解决方案 > 总结日期

问题描述

我想总结周一到周一的日期,但周六和周日累积在周五

## A tibble: 10 x 2
#date        value
#<date>      <dbl>
#  1 2020-01-02  1     --- Thurs
#2 2020-01-03 1        --- Fri
#3 2020-01-04  2       --- Sat
#4 2020-01-06  1       --- Mon
#5 2020-01-07  1       --- Tues
#6 2020-01-08 1        --- Wed
#7 2020-01-09  1       --- Thurs
#8 2020-01-10  1       --- Fri
#9 2020-01-11  2       --- Sat
#10 2020-01-13 1       --- Mon

这就是我要的

#date        value
#<date>      <dbl>
#  1 2020-01-02  1     --- Thurs
#2 2020-01-03 3        --- Fri
#4 2020-01-06  1       --- Mon
#5 2020-01-07  1       --- Tues
#6 2020-01-08 1        --- Wed
#7 2020-01-09  1       --- Thurs
#8 2020-01-10  3       --- Fri
#10 2020-01-13 1       --- Mon

数据

dd = structure(list(date = structure(c(18263, 18264, 18265, 18267, 18268, 18269, 18270, 18271, 18272, 18274), class = "Date"), value = c(1, 1, 2, 1, 1, 1, 1, 1, 2, 1)), row.names = c(NA, -10L), class = c("tbl_df", "tbl", "data.frame"))

标签: rdplyrlubridate

解决方案


我们可以使用周末的'日期' to weekdays,然后使用具有前一个非NA元素的s(假设'日期'已经是d,然后做一个replaceNAfillNAarrangegroup_by sum

library(dplyr)
library(tidyr)
dd %>% 
    mutate(Weekday = weekdays(date),
          date = replace(date, Weekday %in% c("Saturday", "Sunday"), NA)) %>% 
    fill(date) %>% 
    group_by(date) %>% 
    summarise(value = sum(value), .groups = 'drop')

-输出

# A tibble: 8 x 2
#  date       value
#  <date>     <dbl>
#1 2020-01-02     1
#2 2020-01-03     3
#3 2020-01-06     1
#4 2020-01-07     1
#5 2020-01-08     1
#6 2020-01-09     1
#7 2020-01-10     3
#8 2020-01-13     1

推荐阅读