首页 > 解决方案 > 在 csv 字典/嵌套 csv 字典的帮助下使用 Python 编写 CSV 文件

问题描述

我有一个 csv 文件,我想将它写入另一个 csv 文件。这比看起来要复杂一些。希望有人更正我的代码并重写它,以便我可以获得所需的 csvfile。我正在使用 python 2 和 3 两个版本。

mycsvfile:

id,field1,point_x,point_y,point_z
a1,x1,10,12,3
b1,x2,20,22,5
a2,x1,25,17,7
a1,x2,35,13,3
a1,x5,15,19,9
b1,x1,65,11,2
b2,x5,50,23,1
b2,x1,75,17,7
c1,x2,70,87,2
c2,x1,80,67,4
c3,x2,85,51,6

图:mycsvfile

我的代码:

import os
import csv
import collections
from csv import DictWriter    

with open(r'C:\Users\Desktop\kar_csv_test\workfiles\incsv.csv', 'r') as csvfile:
    reader = csv.reader(csvfile, delimiter=',')
    my_dict = collections.defaultdict(dict)
    for row in reader:
        my_dict[row[0]][row[1]] = [row[2],row[3],row[4]]

print (my_dict)


with open(r'C:\Users\Desktop\kar_csv_test\workfiles\outcsv.csv','w', newline='') as wf:
    fieldnames = ['id', 'x1(point_x)', 'x1(point_y)', 'x1(point_z)', 'x2(point_x)', 'x2(point_y)', 'x2(point_z)'] # >>>>>>etc, till x20(point_x), x20(point_y), x20(point_z)
    my_write = csv.DictWriter(wf, fieldnames = fieldnames, delimiter = ',')
    my_write.writeheader()






Desired output as csv file:

id,x1(point_x),x1(point_y),x1(point_z),x2(point_x),x2(point_y),x2(point_z)       
a1,10,12,3,35,13,3,
a2,25,17,7,,,,
b1,65,11,2,20,22,5,
b2,75,17,7,,,,
c1,,,,70,87,2,
c2,80,67,4,,,,
c3,,,,85,51,6,

图:Desiredcsvfile

标签: pythonpandascsvdictionary

解决方案


此答案仅适用于 Python3。csv 模块在 Python2 和 Python3 之间有一个非常不同的接口,编写兼容的代码超出了我准备在这里做的事情。

在这里,我将计算fieldnames列表,并以相同的模式计算每一行:

...
with open(r'C:\Users\Desktop\kar_csv_test\workfiles\outcsv.csv','w', newline='') as wf:
    fieldnames = ['id'] + ['x{}(point_{})'.format(i, j)
                           for i in range(1, 6) for j in "xyz"] # only up to x5 here
    my_write = csv.DictWriter(wf, fieldnames = fieldnames, delimiter = ',')
    my_write.writeheader()
    for k, v in my_dict.items():
        row = {'x{}(point_{})'.format(i, k):
               v.get('x{}'.format(i), ('','',''))[j]   # get allows to get a blank triple is absent
               for i in range(1,6) for j,k in enumerate("xyz")}
        row['id'] = k                                  # do not forget the id...
        my_write.writerow(row)

使用您的示例数据,它给出:

id,x1(point_x),x1(point_y),x1(point_z),x2(point_x),x2(point_y),x2(point_z),x3(point_x),x3(point_y),x3(point_z),x4(point_x),x4(point_y),x4(point_z),x5(point_x),x5(point_y),x5(point_z)
a1,10,12,3,35,13,3,,,,,,,15,19,9
b1,65,11,2,20,22,5,,,,,,,,,
a2,25,17,7,,,,,,,,,,,,
b2,75,17,7,,,,,,,,,,50,23,1
c1,,,,70,87,2,,,,,,,,,
c2,80,67,4,,,,,,,,,,,,
c3,,,,85,51,6,,,,,,,,,

推荐阅读