首页 > 解决方案 > 从无标题的固定格式文件中读取数据

问题描述

使用 pandas 如何创建下面提到的数据框

输入文件

01 Jeff Thomoson   000 11-28-2018 Payments  2,400    Wire transfer
01 Jeff Thomoson   000 11-29-2018 Interest    100    account
01 Paul Simson     000 11-12-2018 Payments  1,000    Wire transfer
01 Paul Simson     000 11-18-2018 Payments    140    net banking
01 John Sans       000 11-28-2018 Payments    300    cheque

输出文件

TR  USER NAME    TR Mode  Date       Narration Amt   Mode
01 Jeff Thomoson 000      11-28-2018 Payments  2,400 Wire transfer
01 Jeff Thomoson 000      11-29-2018 Interest  100   account
01 Paul Simson   000      11-12-2018 Payments  1000  Wire transfer
01 Paul Simson   000      11-18-2018 Payments  140   Net banking
01 John Sans     000      11-28-2018 Payments  300   cheque

标签: pythonpandas

解决方案


戴兹有。您需要使用的是pd.read_fwf. 对于您的示例数据,这是开箱即用的:

names = ['TR', 'USER NAME', 'TR Mode', 'Date', 'Narration', 'Amt', 'Mode']
pd.read_fwf(file, header=None, names=names)

     TR USER NAME  TR Mode        Date Narration    Amt           Mode
1  Jeff  Thomoson        0  11-28-2018  Payments  2,400  Wire transfer
1  Jeff  Thomoson        0  11-29-2018  Interest    100        account
1  Paul    Simson        0  11-12-2018  Payments  1,000  Wire transfer
1  Paul    Simson        0  11-18-2018  Payments    140    net banking
1  John      Sans        0  11-28-2018  Payments    300         cheque

如果您仍然有解析错误,您可能需要稍微修改一下宽度。


推荐阅读