首页 > 解决方案 > 如何通过指定列名获取所需的列数据

问题描述

我有以下代码,

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from scipy import stats
dataFileName='RFInput.xlsx'
sheetName='Rawdata'
sheetNamePara='paraList'
dataRaw=pd.read_excel(dataFileName, sheetname = sheetName)
datapara=pd.read_excel(dataFileName, sheetname = sheetNamePara)

noData=len(dataRaw)
import matplotlib.pylab as plt
from sklearn.cross_validation import train_test_split
from sklearn.cross_validation import cross_val_score
from sklearn.preprocessing import StandardScaler


labels = datapara
x = dataRaw[labels]
y = dataRaw['classVariable']

我所需的列名在 datapara 中指定,但它读取所有 vales 为 NaN

标签: pythonpandasdataframe

解决方案


目前,labels参考数据框。要选择多个列,您需要在标签中指定一个列表,如下所示:

labels = datapara.columns
x = dataRaw[labels]

推荐阅读