首页 > 解决方案 > 关联图。相关错误:“x”必须是数字。所有字段都是数字

问题描述

目的:

我有一个包含许多贷款处理数据的数据集(数据集:coor_data)。希望找出他们的 p 值相关性度量是否使用cor()a corrplot()。但是,我的错误信息:

cor(coor_data) 中的错误:“x”必须是数字。

以下是已完成的工作:

数据集中的几个属性。sapply()将字符转换为数据集中状态属性的数字数据。选定的列。

df_factored_loans <- df_loans %>% mutate(status = as.character(status), state = factor(state), term = factor(term), grade = factor(grade), employment = factor(employment), length = factor(length), home = factor(home), verified = factor(verified), reason = factor(reason))
df_factored_loans$status <- sapply(as.character(df_factored_loans$status), switch, "Charged Off" = 100, "Fully Paid" = 200, "Current" = 300, "Charged Off" = 400, "In Grace Period" = 500, USE.NAMES = F)
coor_data <- select(df_factored_loans, status, amount, rate, payment, income, debtIncRat, delinq2yr, inq6mth, openAcc, pubRec, revolRatio, totalAcc, totalRevLim, accOpen24, avgBal, bcOpen, bcRatio, totalLim, totalRevBal, totalBcLim, totalIlLim)

这是数据集的示例:coor_data

dim(coor_data) 
[1] 50000    21
length(dimnames(coor_data))
[1] 2
dput(coor_data[1,])
structure(list(status = list(100), amount = 8000, rate = 0.14, 
               payment = 272.07, income = 49966, debtIncRat = 30.05, delinq2yr = 0, 
               inq6mth = 0, openAcc = 11, pubRec = 0, revolRatio = 0.55, 
               totalAcc = 15, totalRevLim = 8100, accOpen24 = 8, avgBal = 4369, 
               bcOpen = 43, bcRatio = 95.7, totalLim = 60629, totalRevBal = 48054, 
               totalBcLim = 1000, totalIlLim = 52529), row.names = 1L, class = "data.frame")
typeof(coor_data[1,])
[1] "list"

执行关联:

corrplot(cor(coor_data), method="number", na.rm = TRUE, type = "full")

cor(coor_data) 中的错误:“x”必须是数字

标签: rplotcorrelation

解决方案


推荐阅读