首页 > 解决方案 > 从数据框中的列中添加和减去值

问题描述

我有一个数据框如下:

+-----+---------+-----+-----+-----------------+----------+----------+------------+
| ID  | CURRENT | JAN | FEB | CURRENT_IN_2018 | JAN_2018 | FEB_2018 | UNITS_SWAP |
+-----+---------+-----+-----+-----------------+----------+----------+------------+
| 123 |       2 |   3 |   4 |               5 |        6 |        7 |         12 |
| 456 |       1 |   5 |   0 |               4 |        8 |        6 |          6 |
+-----+---------+-----+-----+-----------------+----------+----------+------------+

我在这里尝试做的是从 中减去数字UNITS_SWAPCURRENT_IN_2018然后JAN_2018依次FEB_2018直到 UNITS_SWAP 中的数字达到零。同样在执行此操作时,UNITS_SWAP将从每行减去的数量添加到它们各自的匹配行中,例如,如果从 current_in_2018 中减去 5 个单位,则在 CURRENT 中添加 5 个单位,依此类推,对于 JAN 和 FEB,最终结果为如下:

+-----+---------+-----+-----+-----------------+----------+----------+------------+
| ID  | CURRENT | JAN | FEB | CURRENT_IN_2018 | JAN_2018 | FEB_2018 | UNITS_SWAP |
+-----+---------+-----+-----+-----------------+----------+----------+------------+
| 123 |       7 |   9 |   5 |               0 |        0 |        6 |          0 |
| 456 |       5 |   7 |   0 |               0 |        6 |        6 |          0 |
+-----+---------+-----+-----+-----------------+----------+----------+------------+

加载数据的脚本:

DF <- data.frame(ID = c(123,456), 
                  CURRENT = c(2,1), 
                  JAN = c(3,5),
                  FEB=c(4,0),                  
                  CURRENT_2018 = c(5,4), 
                  JAN_2018 = c(6,8),
                  FEB_2018=c(7,6),
                  UNITS_SWAP =c(12,6))

标签: raggregateplyrdplyr

解决方案


您可以这样做 - 尽管请注意它会覆盖您的源 DF:

cols <- c('CURRENT', 'JAN', 'FEB')
for (i in 1:NROW(DF)) {
  while (DF[i, 'UNITS_SWAP'] > 0) {
    for (col in cols) {
      excess <- min(DF[i,'UNITS_SWAP'], DF[i, paste0(col, '_2018')])
      DF[i, col] <- DF[i, col] + excess
      DF[i, paste0(col, '_2018')] <- DF[i, paste0(col, '_2018')] - excess
      DF[i, 'UNITS_SWAP'] <- DF[i, 'UNITS_SWAP'] - excess
    }
  }
}

因为你的列有明确的结构(columnmatching column_2018),我们只需要按照你指定的顺序遍历它们,然后粘贴_2018就可以得到相关的匹配列。


推荐阅读