r - 使用多个正则表达式修改 data.table 内容
问题描述
我有一个dt = data.table
带有字符的列。
我需要对该列执行多个正则表达式操作,我将其写为:
dt[, Description := sapply(Description, tolower)][
, Description := sapply(Description, gsub, pattern = " $", replacement = "")][
, Description := sapply(Description, gsub, pattern = " ", replacement = " ")][
, Description := sapply(Description, gsub, pattern = "ões\\>", replacement = "ão")][
, Description := sapply(Description, gsub, pattern = "eis\\>", replacement = "el")][
, Description := sapply(Description, gsub, pattern = "as\\>", replacement = "a")][
, Description := sapply(Description, gsub, pattern = "ais\\>", replacement = "al")][
, Description := sapply(Description, gsub, pattern = "es\\>", replacement = "e")][
, Description := sapply(Description, gsub, pattern = "ns\\>", replacement = "m")][
, Description := sapply(Description, gsub, pattern = "s\\>", replacement = "")]
这些基本上都是在葡萄牙语中将复数变为单数的所有方法。
有没有更有效和优雅的方式来做到这一点?
解决方案
推荐阅读
- r - 整数的下限函数:找到十位的最小值
- r - 计算 R 中 ARIMA 模型残差的 RMSE 和 ACF 图
- css - 段落标签未按预期运行
- docker - 来自守护程序的 Kubernetes GCP 错误响应:未指定命令:CreateContainerError
- sql-server - Pyodbc 调用带参数的存储过程
- java - 将 ArrayList 的最后 3 个元素复制到 List
- python - sqlalchemy 外复合键(声明性)
- android - 转换为字节数组和从字节数组转换后,Android Bitmap 上的颜色不正确
- javascript - Puppeteer 如何检查页面是否已重定向
- java - 无法读取多个序列化对象