首页 > 解决方案 > 熊猫加载csv文件ValueError

问题描述

该脚本的目的是读取一个 csv 文件并创建一个 pandas 数据框,然后使用 OOP 样式打印前 5 个原始数据。

这是代码:

import pandas as pd
import talib


class Data:
    def __init__(self):
        self.df = pd.DataFrame()
        self.names = ['Date', 'Time', 'Open', 'High', 'Low', 'Close', 'Volume']
        self.file(self)

    def file(self, file):
        df = pd.read_csv(file, names=self.names,
                         parse_dates={'Release Date': ['Date', 'Time']})
        print(df.head())


x = Data()
x.file(file=r"D:\Projects\Project Forex\EURUSD.csv")

这是错误:

Traceback (most recent call last):

  File "C:/Users/Sayed/PycharmProjects/project/Technical Analysis.py", line 15, in <module>
    x = Data()

  File "C:/Users/Sayed/PycharmProjects/project/Technical Analysis.py", line 9, in __init__
    self.file(self)

  File "C:/Users/Sayed/PycharmProjects/project/Technical Analysis.py", line 13, in file
    parse_dates={'Release Date': ['Date', 'Time']})

  File "C:\Users\Sayed\miniconda3\lib\site-packages\pandas\io\parsers.py", line 676, in parser_f
    return _read(filepath_or_buffer, kwds)

  File "C:\Users\Sayed\miniconda3\lib\site-packages\pandas\io\parsers.py", line 431, in _read
    filepath_or_buffer, encoding, compression

  File "C:\Users\Sayed\miniconda3\lib\site-packages\pandas\io\common.py", line 200, in get_filepath_or_buffer

    raise ValueError(msg)
ValueError: Invalid file path or buffer object type: <class '__main__.Data'>

标签: pythonpandasoop

解决方案


罪魁祸首是__init__:的最后一行self.file(self)。当它被调用时__init__self是一个Data对象,而该file方法必须使用包含 csv 文件路径的字符串来调用。

修复很简单:删除该行:

class Data:
    def __init__(self):
        self.df = pd.DataFrame()
        self.names = ['Date', 'Time', 'Open', 'High', 'Low', 'Close', 'Volume']

    def file(self, file):
        ...

但它仍然不一致:self.df初始化为空数据框,这很好,但该file方法不会更新它,而是使用局部变量df(与self.dfPython 不同)。你应该做:

def file(self, file):
    self.df = pd.read_csv(file, names=self.names,
                     parse_dates={'Release Date': ['Date', 'Time']})
    print(self.df.head())

推荐阅读