首页 > 解决方案 > pd.read_csv 显示索引和列名但没有值

问题描述

我很困惑:加载 csv 工作正常,即:没有错误和索引 en 列名显示,但我的 DF 中没有值。下载此 csv,将其转换为 Excel,然后将其加载到 Pandas 中,将其转换为 csv (pd.to_csv) 并再次加载,因为 csv 可以正常工作。csv 加载为数据框.... 这个原始 csv 中一定有一些我不明白的东西。事实上,我的“问题”通过所有这些转换得到了解决。但我想了解什么是错的/我学到了什么。

因此,如果有人知道我在这里做错了什么,那就太好了。谢谢!

link = 'https://www.vektis.nl/uploads/Docs%20per%20pagina/Open%20Data%20Bestanden/2018/Vektis%20Open%20Databestand%20Zorgverzekeringswet%202018%20-%20postcode3.csv'
df = pd.read_csv(link)

df.shape
(137099, 1)

df.info() 看起来很奇怪,而 df.describe() 是空的.....

如前所述:将原始 csv 转换为 xlsx,将其加载到 pandas 并转换为 csv 给出一个 df,带有值等。

df2.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 137099 entries, 0 to 137098
Data columns (total 28 columns):
GESLACHT                                  137098 non-null object
LEEFTIJDSKLASSE                           137098 non-null object
POSTCODE_3                                137098 non-null float64
AANTAL_BSN                                137099 non-null int64
AANTAL_VERZEKERDEJAREN                    137099 non-null float64
KOSTEN_MEDISCH_SPECIALISTISCHE_ZORG       137099 non-null float64
KOSTEN_FARMACIE                           137099 non-null float64
KOSTEN_SPECIALISTISCHE_GGZ                137099 non-null float64
KOSTEN_HUISARTS_INSCHRIJFTARIEF           137099 non-null float64
KOSTEN_HUISARTS_CONSULT                   137099 non-null float64
KOSTEN_HUISARTS_MDZ                       137099 non-null float64
KOSTEN_HUISARTS_OVERIG                    137099 non-null float64
KOSTEN_HULPMIDDELEN                       137099 non-null float64
KOSTEN_MONDZORG                           137099 non-null float64
KOSTEN_PARAMEDISCHE_ZORG_FYSIOTHERAPIE    137099 non-null float64
KOSTEN_PARAMEDISCHE_ZORG_OVERIG           137099 non-null float64
KOSTEN_ZIEKENVERVOER_ZITTEND              137099 non-null float64
KOSTEN_ZIEKENVERVOER_LIGGEND              137099 non-null float64
KOSTEN_KRAAMZORG                          137099 non-null float64
KOSTEN_VERLOSKUNDIGE_ZORG                 137099 non-null float64
KOSTEN_GENERALISTISCHE_BASIS_GGZ          137099 non-null float64
KOSTEN_LANGDURIGE_GGZ                     137099 non-null float64
KOSTEN_GRENSOVERSCHRIJDENDE_ZORG          137099 non-null float64
KOSTEN_EERSTELIJNS_ONDERSTEUNING          137099 non-null float64
KOSTEN_GERIATRISCHE_REVALIDATIEZORG       137099 non-null float64
KOSTEN_EERSTELIJNSVERBLIJF                137099 non-null float64
KOSTEN_VERPLEGING_EN_VERZORGING           137099 non-null float64
KOSTEN_OVERIG                             137099 non-null float64
dtypes: float64(25), int64(1), object(2)
memory usage: 29.3+ MB

1

​</p>

标签: pythonpandasdataframecsv

解决方案


在您的情况下,您只需要提供一个分隔符';'

link = 'https://www.vektis.nl/uploads/Docs%20per%20pagina/Open%20Data%20Bestanden/2018/Vektis%20Open%20Databestand%20Zorgverzekeringswet%202018%20-%20postcode3.csv'
df = pd.read_csv(link, sep=';')
print(df)

       GESLACHT LEEFTIJDSKLASSE  POSTCODE_3  ...  KOSTEN_EERSTELIJNSVERBLIJF  KOSTEN_VERPLEGING_EN_VERZORGING  KOSTEN_OVERIG
0           NaN             NaN         NaN  ...                    60376.04                        637668.87      496931.54
1             M               0         0.0  ...                        0.00                        121744.76         890.41
2             M               0       101.0  ...                        0.00                           565.22         154.32
3             M               0       102.0  ...                        0.00                           342.72          77.16
4             M               0       103.0  ...                        0.00                         11192.82        2498.61
...         ...             ...         ...  ...                         ...                              ...            ...
137094        V             90+       995.0  ...                    17126.82                        230642.72           0.00
137095        V             90+       996.0  ...                    15504.98                        133670.79           0.00
137096        V             90+       997.0  ...                     9608.72                        172186.49           0.00
137097        V             90+       998.0  ...                    37083.13                        733906.73        1083.82
137098        V             90+       999.0  ...                    26639.36                         99737.32           0.00

[137099 rows x 28 columns]

推荐阅读