首页 > 解决方案 > 从对象列表创建熊猫数据框

问题描述

我有以下代码:

import numpy as np
import pandas as pd

class my_class:
    def __init__(self, vector, par1, par2, par3):
        self.vector = vector  # this is a 1D numpy object.
        self.label = '({}, {}, {})'.format(par1, par2, par3)

obj1 = my_class(np.array([0, 1, 2, 3]), 4, -8, 7)
obj2 = my_class(np.array([6, 7, 8, 9]), 10, 4, 15)
obj3 = my_class(np.array([-1, -2, 3, -4]), 9, 3, 6)
obj4 = my_class(np.array([-10, -15, -20, 3]), 1, -2, 6)

my_list_of_objects = [obj1, obj2, obj3, obj4]

df = pd.DataFrame([o.__dict__ for o in my_list_of_objects])
print(df)

这是我得到的当前结果:

               vector        label
0        [0, 1, 2, 3]   (4, -8, 7)
1        [6, 7, 8, 9]  (10, 4, 15)
2     [-1, -2, 3, -4]    (9, 3, 6)
3  [-10, -15, -20, 3]   (1, -2, 6)

我想获得以下熊猫数据框:

   (4, -8, 7)   (10, 4, 15)   (9, 3, 6)   (1, -2, 6)
0       0            6           -1          -10
1       1            7           -2          -15
2       2            8            3          -20
3       3            9           -4            3

正如您在类中看到的,属性标签具有需要使用的相应列的名称。

标签: pythonpandasnumpydataframedictionary

解决方案


使用unstack并从中构建一个新的数据框

df_final = pd.DataFrame.from_dict(df.set_index('label').unstack().droplevel(0)
                                                                 .to_dict())

label或者从and构造一个字典vector并将其传递给数据框构造函数

df_final = pd.DataFrame.from_dict(dict(df[['label', 'vector']].to_numpy()))

Out[267]:
   (4, -8, 7)  (10, 4, 15)  (9, 3, 6)  (1, -2, 6)
0           0            6         -1         -10
1           1            7         -2         -15
2           2            8          3         -20
3           3            9         -4           3

推荐阅读