首页 > 解决方案 > Python:Pandas 在读取表格文本文件时不分隔列

问题描述

我有一个这样的文本文件:

PERSONAL INFORMATION              

 First Name:             Michael
 Last Name:              Junior
 Birth Date:             May 17, 1999 
 Location:               Whitehurst Hall 301. City: Stillwater. State: OK
                         Taken on July 8, 2000 10:50:30 AM MST

WORK EXPERIENCE
Work type select                                        Part-time
ID number                                               10124
Company name                                            ABCDFG Inc.
Positions                                               Software Engineer/Research Scientist
                                                        Data Analyst/Scientist

如您所见,第一列是特征名称,第二列是值。我使用以下代码阅读它:

import pandas as pd
import numpy as np
import scipy as sp


df=pd.read_table('personal.txt',skiprows=1)
pd.set_option('display.max_colwidth',10000)
pd.set_option('display.max_rows',1000)
df

但它合并列和输出:

    PERSONAL INFORMATION
0   First Name: Michael
1   Last Name: Junior
2   Birth Date: May 17, 1999
3   Location: Whitehurst Hall 301. City: Stillwater. State: OK
4   Taken on July 8, 2000 10:50:30 AM MST
5   WORK EXPERIENCE
6   Work type select Part-time
7   ID number 10124
8   Company name Google Inc.
9   Positions Software Engineer/Research Scientist
10  Data Analyst/Scientist

我也应该摆脱那些标题个人信息和工作经验。我怎样才能以一种在两列中给出适当结果的方式阅读?

标签: pythonpandasdataframemultiple-columns

解决方案


推荐阅读