首页 > 解决方案 > 在unix中将多行转换为单行

问题描述

我有一列中有多行数据的文件,我希望将多行转换为单行。

这是带有标题的示例

final_date|Notes|Status
04/17/2019|"- OB Team - 
Number of Attempt(s): 1
Outcome:other
Order (RMO):0
Campaign : ABC
Additional Notes:  not a working number  
* If any call return to transfer to OB team *"|Complete
04/18/2019|"- OB Team - 

Number of Attempt(s): 3
Outcome: NO ANSWER
Order (RMO): 0
Campaign Name: ABC

*If return call, transfer to OB team* 


- OB TEAM - 
Number of Attempt(s):  1 
Outcome:  VM
Order (RMO):  0
Campaign Name:  ABC 
Additional Notes: None
*If return call, transfer to OB team*"|Complete

以上数据有两条记录。我希望他们将其转换为单行,然后加载到 Hive 表。

上述数据应转换如下。

final_date|Notes|Status
04/17/2019|"- OB Team - Number of Attempt(s): 1 Outcome:other Order (RMO):0 Campaign : ABC Additional Notes:  not a working number * If any call return to transfer to OB team *"|Complete
04/18/2019|"- OB Team - Number of Attempt(s): 3 Outcome: NO ANSWER Order (RMO): 0 Campaign Name: ABC *If return call, transfer to OB team*  - OB TEAM - Number of Attempt(s):  1  Outcome:  VM Order (RMO):  0 Campaign Name:  ABC Additional Notes: None *If return call, transfer to OB team*"|Complete

有人可以帮我解决这个问题。

标签: shellcsvunixhivemultiline

解决方案


根据当前行中双引号的数量处理输出记录分隔符。

awk -F\" 'BEGIN{ors=ORS} NF&&!(NF%2){ORS=(ORS!=ors)?ors:OFS} 1' file

推荐阅读