python - 使用键作为列将字典列表转换为 Pandas DataFrame
问题描述
我在下面有一个市场数据字典列表,如何使用其键作为列/将其转换为 Pandas DataFrame
[
'{"level":0,"side":"Ask","price":"13745.75000","volume":"2"}',
'{"level":1,"side":"Ask","price":"13745.50000","volume":"5"}',
'{"level":2,"side":"Ask","price":"13745.25000","volume":"6"}',
'{"level":3,"side":"Ask","price":"13745.00000","volume":"15"}',
'{"level":4,"side":"Ask","price":"13744.75000","volume":"4"}',
'{"level":5,"side":"Ask","price":"13744.50000","volume":"5"}',
'{"level":6,"side":"Ask","price":"13744.25000","volume":"3"}',
'{"level":7,"side":"Ask","price":"13744.00000","volume":"3"}',
'{"level":8,"side":"Ask","price":"13743.75000","volume":"2"}',
'{"level":9,"side":"Ask","price":"13743.50000","volume":"1"}',
'{"level":10,"side":"Bid","price":"13743.00000","volume":"2"}',
'{"level":11,"side":"Bid","price":"13742.75000","volume":"4"}',
'{"level":12,"side":"Bid","price":"13742.50000","volume":"3"}',
'{"level":13,"side":"Bid","price":"13742.25000","volume":"4"}',
'{"level":14,"side":"Bid","price":"13742.00000","volume":"4"}',
'{"level":15,"side":"Bid","price":"13741.75000","volume":"4"}',
'{"level":16,"side":"Bid","price":"13741.50000","volume":"4"}',
'{"level":17,"side":"Bid","price":"13741.25000","volume":"4"}',
'{"level":18,"side":"Bid","price":"13741.00000","volume":"4"}',
'{"level":19,"side":"Bid","price":"13740.75000","volume":"2"}',
]
解决方案
你有一个字典的字符串表示列表。如果您可以将其转换为字典列表,则转换为 pandas 数据框会更容易。
为此,您可以使用ast.literal_eval
比eval
这样简单的更安全的
from ast import literal_eval
data = ['{"level":0,"side":"Ask","price":"13745.75000","volume":"2"}', '{"level":1,"side":"Ask","price":"13745.50000","volume":"5"}', '{"level":2,"side":"Ask","price":"13745.25000","volume":"6"}', '{"level":3,"side":"Ask","price":"13745.00000","volume":"15"}', '{"level":4,"side":"Ask","price":"13744.75000","volume":"4"}', '{"level":5,"side":"Ask","price":"13744.50000","volume":"5"}', '{"level":6,"side":"Ask","price":"13744.25000","volume":"3"}', '{"level":7,"side":"Ask","price":"13744.00000","volume":"3"}', '{"level":8,"side":"Ask","price":"13743.75000","volume":"2"}', '{"level":9,"side":"Ask","price":"13743.50000","volume":"1"}', '{"level":10,"side":"Bid","price":"13743.00000","volume":"2"}', '{"level":11,"side":"Bid","price":"13742.75000","volume":"4"}', '{"level":12,"side":"Bid","price":"13742.50000","volume":"3"}', '{"level":13,"side":"Bid","price":"13742.25000","volume":"4"}', '{"level":14,"side":"Bid","price":"13742.00000","volume":"4"}', '{"level":15,"side":"Bid","price":"13741.75000","volume":"4"}', '{"level":16,"side":"Bid","price":"13741.50000","volume":"4"}', '{"level":17,"side":"Bid","price":"13741.25000","volume":"4"}', '{"level":18,"side":"Bid","price":"13741.00000","volume":"4"}', '{"level":19,"side":"Bid","price":"13740.75000","volume":"2"}']
data_new = [literal_eval(i) for i in data]
输出将如下所示,
[{'level': 0, 'side': 'Ask', 'price': '13745.75000', 'volume': '2'},
{'level': 1, 'side': 'Ask', 'price': '13745.50000', 'volume': '5'},
{'level': 2, 'side': 'Ask', 'price': '13745.25000', 'volume': '6'},
{'level': 3, 'side': 'Ask', 'price': '13745.00000', 'volume': '15'},
{'level': 4, 'side': 'Ask', 'price': '13744.75000', 'volume': '4'},
{'level': 5, 'side': 'Ask', 'price': '13744.50000', 'volume': '5'},
{'level': 6, 'side': 'Ask', 'price': '13744.25000', 'volume': '3'},
{'level': 7, 'side': 'Ask', 'price': '13744.00000', 'volume': '3'},
{'level': 8, 'side': 'Ask', 'price': '13743.75000', 'volume': '2'},
{'level': 9, 'side': 'Ask', 'price': '13743.50000', 'volume': '1'},
{'level': 10, 'side': 'Bid', 'price': '13743.00000', 'volume': '2'},
{'level': 11, 'side': 'Bid', 'price': '13742.75000', 'volume': '4'},
{'level': 12, 'side': 'Bid', 'price': '13742.50000', 'volume': '3'},
{'level': 13, 'side': 'Bid', 'price': '13742.25000', 'volume': '4'},
{'level': 14, 'side': 'Bid', 'price': '13742.00000', 'volume': '4'},
{'level': 15, 'side': 'Bid', 'price': '13741.75000', 'volume': '4'},
{'level': 16, 'side': 'Bid', 'price': '13741.50000', 'volume': '4'},
{'level': 17, 'side': 'Bid', 'price': '13741.25000', 'volume': '4'},
{'level': 18, 'side': 'Bid', 'price': '13741.00000', 'volume': '4'},
{'level': 19, 'side': 'Bid', 'price': '13740.75000', 'volume': '2'}]
如您所见,它是一个字典列表。Pandas 可以轻松地处理它。要转换为 pandas 数据框,请执行此操作,
df = pd.DataFrame(data_new)
level side price volume
0 0 Ask 13745.75000 2
1 1 Ask 13745.50000 5
2 2 Ask 13745.25000 6
3 3 Ask 13745.00000 15
4 4 Ask 13744.75000 4
5 5 Ask 13744.50000 5
6 6 Ask 13744.25000 3
7 7 Ask 13744.00000 3
8 8 Ask 13743.75000 2
9 9 Ask 13743.50000 1
10 10 Bid 13743.00000 2
11 11 Bid 13742.75000 4
12 12 Bid 13742.50000 3
13 13 Bid 13742.25000 4
14 14 Bid 13742.00000 4
15 15 Bid 13741.75000 4
16 16 Bid 13741.50000 4
17 17 Bid 13741.25000 4
18 18 Bid 13741.00000 4
19 19 Bid 13740.75000 2
如果需要,您也可以将级别设置为索引,
df = pd.DataFrame(df_new).set_index("level")
推荐阅读
- python-3.x - Py + SeleniumWebdriver - NoSuchElementException not working
- reactjs - 使用 JHipster 和 React 的 adyen 电子商务演示问题
- google-sheets - Google Sheets VALUE 函数给出错误的结果
- docker - 错误 [内部] 加载 docker.io/library/python.alpine3.8:latest 的元数据
- windows - 从 Windows 任务栏快捷方式启动的 PuTTY 会话窗口没有获得焦点
- javascript - 异步确实等待数据在 redux-thunk 函数中返回
- pyspark - 在 PySpark 中创建数据框时处理不同的 JSON 模式
- python - Nested list manipulation in Python with irregular columns
- python - 两个python解释器之间的区别
- javascript - 在 JavaScript 中反转链接列表会覆盖 this.head 并且不会更新