首页 > 解决方案 > 具有 2 个分组变量且 mat=TRUE 的 `describeBy` 会产生错误“下标越界”

问题描述

在摘要统计中出现"subscript out of bounds"错误。describeBy当按两个变量分组并且mat=TRUE数据集> 1000 obs时。如果 mat=TRUE 被删除然后它是一个列表输出,它适用于大型数据集。使用心理版本 1.8.12。

Error in [<-(tmp, var, group + 1, value = dim.names[[group]][[groupi]]) : 
subscript out of bounds error

任何帮助表示赞赏

“数据”有 120951 个观察值,6 个变量。

prod_metrics <- describeBy(list(data$TOTALCHARGES, data$NONCOVEREDCHARGES,
                            data$COVEREDCHARGES, data$HCE_TARGET_AMT),
                     list(data$PARENTCLIENTCODE, data$RECEIVEDDATE), mat = TRUE, digits = 2)

下面是数据结构:

dput(head(data))
structure(list(TOTALCHARGES = c(216, 496.68, 150, 610, 6259.73, 1020.97), PARENTCLIENTCODE = structure(c(468L, 253L, 456L, 456L, 43L, 167L), .Label = c("5STAR-P", "AAE-P", "AALT-P", "ABA-P", 
"ABN-P", "ACN-P","XIND-P","XKAI-P","XOXF-P","XPACI-P","XPIC-P","XRX-P","XSFH-P","XSISTERP"), class = "factor"), RECEIVEDDATE = structure(c(1L, 
1L, 1L, 1L, 1L, 1L), .Label = c("2019-02-14","2019-02-15","2019-02-16","2019-02-17","2019-02-18","2019-02-19","2019-02-20","2019-02-21", 
"2019-04-03","2019-04-04","2019-04-05"), class = "factor"), NONCOVEREDCHARGES = c(0, 0, 0, 0, 0, 0), COVEREDCHARGES = c(216, 
496.68, 150, 610, 6259.73, 1020.97), HCE_TARGET_AMT = c(216,496.68, 150, 610, 6259.73, 1020.97)),.Names = c("TOTALCHARGES", 
"PARENTCLIENTCODE", "RECEIVEDDATE", "NONCOVEREDCHARGES", "COVEREDCHARGES", 
"HCE_TARGET_AMT"), row.names = c(NA, 6L), class = "data.frame")

标签: rpsych

解决方案


推荐阅读