python - 熊猫删除括号之间的字符
问题描述
我想删除之间的字符[]
,目前我正在做
df['Text'] = df['Text'].str.replace(r"\[.*\]","")
但是输出是不可取的。在它之前[image] This document
和之后******* This document
是*
空格。
我如何摆脱这个空白。
编辑 1
的Text
列df
如下所示:
ID Text
0 REAL ESTATE LEASE THIS INDUSTRIAL REAL ESTAT...
5 Lease AureementMade and signed on the \ of Aug...
6 FIRST AMENDMENT OF LEASEDATE: August 31, 2001L...
8 [image: image0.jpg] Jack[image: image1.jb2] ...
9 [image: image0.jpg] ABC SALES Meeting 97...
14 FIRST AMENDMENT OF LEASETHIS FIRST AMENDMENT O...
17 [image: image0.tif] Deep ML LEASE SERVI...
22 [image: image0.jpg] F 15 083 EX [image: image1...
26 LEASE AGREEMENT—GROSS LEASEBASIC LEASE PROVISI...
28 [image: image0.jpg] 17. Medical VERIFICATION...
31 [image: image0.jpg] [image: image1.jb2] PLL 3...
32 SUBLEASETHIS SUBLEASE this “Sublease” made as ...
34 [image: image0.tif] Lease Agreement May 10, 20...
35 13057968.3 1 Initials: _____ _____ SECOND ...
42 [image: image0.jpg] Jack Dowson Buy Real MI...
46 Deep – Machine Learning LEASE B...
我想看看
ID Text
0 REAL ESTATE LEASE THIS INDUSTRIAL REAL ESTAT...
5 Lease AureementMade and signed on the \ of Aug...
6 FIRST AMENDMENT OF LEASEDATE: August 31, 2001L...
8 Jack ...
9 ABC SALES Meeting 97...
14 FIRST AMENDMENT OF LEASETHIS FIRST AMENDMENT O...
17 Deep ML LEASE SERVI...
22 F 15 083 EX ...
26 LEASE AGREEMENT—GROSS LEASEBASIC LEASE PROVISI...
28 17. Medical VERIFICATION...
31 PLL 3...
32 SUBLEASETHIS SUBLEASE this “Sublease” made as ...
34 Lease Agreement May 10, 20...
35 13057968.3 1 Initials: _____ _____ SECOND ...
42 Jack Dowson Buy Real MI...
46 Deep – Machine Learning LEASE B...
解决方案
看起来你需要.str.strip()
前任:
df = pd.DataFrame({"ID": [1,2,3], "Text": ["[image: 123.jpg] This document", "[image: image.jpg] Readers of the article", "The agreement between [image: image.jpg] two parties"]})
df["Text"] = df["Text"].str.replace(r"(\s*\[.*?\]\s*)", " ").str.strip()
print(df)
输出:
0 This document
1 Readers of the article
2 The agreement between two parties
Name: Text, dtype: object
推荐阅读
- r - 从 mlr 中的重采样函数中检索模型
- wordpress - Wordpress 将 woocommerce 类别作为数组获取
- javascript - 使用 JavaScript 使用 HTML 表格个性化表格标题、创建分页、排序和过滤
- libgdx - 使用 ligGDX 项目生成器时出错
- python-3.x - 编译 python pyo 文件
- android - 在kotlin的自定义类型arraylist中按id搜索对象
- spring - 如何制作参数化作业?
- java - Java Runtime.getRuntime().exec 不能在命令中使用双引号
- java - 在 JPanel 中填充 JButton
- javascript - Sublime Text 3 构建问题 Haml/jQuery $ is not defined