首页 > 解决方案 > 当我想使用 R 函数 apply 处理 DocumentTermMatrix 类型的数据集时,出现错误

问题描述

下面是我的 DTM 类型数据集:

查看(sms_dtm_freq_train) sms_dtm_freq_train[["dimnames"]] $ Docs [1]"1""2""3""4""5""6""7""8""9""10""11"" 12” “13” “14”
[15] “15” “16” “17” “18” “19” “20” “21” “22” “23” “24” “25” “26” “27” “28”
[29] “29” “30” “31” “32” “33” “34” “35” “36” “37” “38” “39” “40” “41” “42”
[43] “43” “44” “45” “46” “47” “48” “49” “50” “51” “52” “53” “54” “55” “56”
[57] “57” ""58""59""60""61""62""63""64""65""66""67""68""69""70"
[71]"71""72"" 73” “74” “75” “76” “77” “78” “79” “80” “81” “82” “83” “84”47” “48” “49” “50” “51” “52” “53” “54” “55” “56” [57] “57” “58” “59” “60” “61” “62” “63” “64” “65” “66” “67” “68” “69” “70” [71] “71” “72” “73” “74” “75” “76” “77” “78” ” “79” “80” “81” “82” “83” “84”47” “48” “49” “50” “51” “52” “53” “54” “55” “56” [57] “57” “58” “59” “60” “61” “62” “63” “64” “65” “66” “67” “68” “69” “70” [71] “71” “72” “73” “74” “75” “76” “77” “78” ” “79” “80” “81” “82” “83” “84”57” “58” “59” “60” “61” “62” “63” “64” “65” “66” “67” “68” “69” “70” [71] “71” “72” “73” “74” “75” “76” “77” “78” “79” “80” “81” “82” “83” “84”57” “58” “59” “60” “61” “62” “63” “64” “65” “66” “67” “68” “69” “70” [71] “71” “72” “73” “74” “75” “76” “77” “78” “79” “80” “81” “82” “83” “84”“78” “79” “80” “81” “82” “83” “84”“78” “79” “80” “81” “82” “83” “84”
[85] “85” “86” “87” “88” “89” “90” “91” “92” “93” “94” “95” “96” “97” “98”
[99] “99” ““100”“101”“102”“103”“104”“105”“106”“107”“108”“109”“110”“111”“112”...... [达到 getOption("max.print") -- 省略 4057 个条目]

$Terms [1] "â£wk" "…" "â€"" "abiola" "abl" "abt"
[7] "accept" "access" "account" "across" "activ" "actual"
[13] “add” “address” “admir” “adult” “advanc” “aft”
[19] “afternoon” “aftr” “age” “ago” “ahead” “aight”
[25] “aint” “air ” “aiyah” “alex” “几乎” “alon”
[31] “alreadi”“好吧”“alrit”“也”“alway”“amp”
[37]“angri”“announc”“anoth”“answer”“anybodi”“anymor”
[43] “anyon” “anyth” “anytim” “anyway” “分开” “app”
[49] “appli” “任命” “appreci” “april” “ard” “区域”
[55] “争论” “arm “周围”“安排”“逮捕”“到达”
[61]“尽快”“询问”“询问”“睡觉”“屁股”“尝试”
[67]“拍卖”“可用”“AVE”“避免”“等待”“奖”
[73]“离开”“awesom” “babe” “babi” “back” “bad”
[79] “bag” “bak” “balanc” “bank” “bare” “bath”
[85] “batteri” “bcoz” “bcum” “bday ” “美丽” “成为”“生日”“美丽”“成为”“生日”“美丽”“成为”
[91]“床”“卧室”“开始”“相信”“贝利”“最好”
[97]“更好”“出价”“大”“比尔”“鸟”“生日”
...... ……

[达到 getOption("max.print") -- 省略 1057 个条目]

当我运行时:

sms_train <- apply(sms_dtm_freq_train, MARGIN = 2, convert_counts)

我收到以下错误消息:

apply(sms_dtm_freq_train, MARGIN = 2, convert_counts) 中的错误:dim(X) 必须具有正长度

我输入了与教科书“使用 R 进行机器学习”相同的代码,但我得到了错误。我很困扰。任何人都可以帮我解决这个问题吗?谢谢!

标签: r

解决方案


推荐阅读