dataweave - 从 XML 映射到 CSV 循环遍历由下划线分隔的值
问题描述
<tXML>
<Header>
<Source>J1_RETAIL</Source>
<Action_Type>Update</Action_Type>
<Sequence_Number>0</Sequence_Number>
<Batch_ID>4383352</Batch_ID>
<Reference_ID>04381645</Reference_ID>
<User_ID>SAP</User_ID>
<Password>password</Password>
<Message_Type>SAP_DO</Message_Type>
<Company_ID>J1</Company_ID>
<Msg_Locale>English (United States)</Msg_Locale>
<Msg_Time_Zone>Eastern Standard Time</Msg_Time_Zone>
<Version></Version>
<Internal_Reference_ID></Internal_Reference_ID>
<Internal_Date_Time_Stamp></Internal_Date_Time_Stamp>
<External_Reference_ID></External_Reference_ID>
<External_Date_Time_Stamp></External_Date_Time_Stamp>
</Header>
<Message>
<DistributionOrder>
<ProcessInfo>
<RefTextField1></RefTextField1>
<RefTextField2></RefTextField2>
<RefTextField3>S082</RefTextField3>
<RefTextField4></RefTextField4>
<RefTextField5></RefTextField5>
<RefTextField6></RefTextField6>
<RefTextField7>J1</RefTextField7>
<RefTextField8>[0001333006_SAPTOMIF]</RefTextField8>
<RefTextField9></RefTextField9>
<RefTextField10>[ _20191223]</RefTextField10>
<RefNumberField1>20191220</RefNumberField1>
<RefNumberField2>34621</RefNumberField2>
<RefNumberField3></RefNumberField3>
<RefNumberField4>53</RefNumberField4>
<RefNumberField5>13</RefNumberField5>
</ProcessInfo>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>05</NoteCode>
<CommentText>[00000_ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
<CustomFieldList>
<CustomField>
<Name>SiteID</Name>
<Value></Value>
</CustomField>
<LineItem>
<DoLineNbr>1</DoLineNbr>
<ItemName>135465</ItemName>
<Description>A</Description>
<UpdateActionType></UpdateActionType>
<PackageType></PackageType>
<DoLineStatus>Released</DoLineStatus>
<InventoryAttributes>
<InventoryType>F</InventoryType>
<ProductStatus></ProductStatus>
<BatchNbr></BatchNbr>
<CountryOfOrigin></CountryOfOrigin>
<ItemAttribute1>R</ItemAttribute1>
<ItemAttribute2></ItemAttribute2>
<ItemAttribute3></ItemAttribute3>
<ItemAttribute4></ItemAttribute4>
<ItemAttribute5></ItemAttribute5>
</InventoryAttributes>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>20</NoteCode>
<CommentText>[JPY_ _Mens_ _ _ _ _ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>13</NoteCode>
<CommentText>[ _00016000.00000_.00000_.00000_ _ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
<Comment>
<NoteType>SC</NoteType>
<NoteCode>02</NoteCode>
<CommentText>[ _ _ _ _R]</CommentText>
<Visibility>0</Visibility>
</Comment>
</LineItem>
</DistributionOrder>
</Message>
</tXML>
我在 XMl 上方映射到 CSV 管道分隔格式,需要有关如何循环和打印 CommentText 值的帮助。
我的代码看起来像这样
output application/csv header = false , separator = "|" , quoteValues = false
---
payload.tXML.Message.*DistributionOrder map ((DistributionOrder , indexofDistributionOrder) -> {
column_1: "000000001",
column_2: "0",
column_3: "0",
column_4: (LineItem.LineProcessInfo.LineRefTextField2 splitBy "_")[0] replace "[" with "" default "",
column_5: (LineItem.LineProcessInfo.LineRefTextField2 splitBy "_")[1] replace "[" with "" default "",
column_6: ????
column_7: "",
column_8: "",
column_9: "",
column_10: "",
})
在上面的代码中,我需要检查条件if NoteType = MB and NoteCode =05
,我必须遍历每个注释文本并在 DistributionOrder 下获取值,
[00000_ 1_2_3_4_5_6_7]
我需要将每个值(例如 1、2、3...7)放在每一行中。
00000001|0001333006|20191220|34621|SAPTOMIF4|CommentText1|||6500055464123|252||J1||||MW09449|
00000001|0001333006|20191220|34621|SAPTOMIF4|CommentText2|||6500055464123|252||J1||||MW09449|
00000001|0001333006|20191220|34621|SAPTOMIF4|CommentText3|||6500055464123|252||J1||||MW09449|
00000001|0001333006|20191220|34621|SAPTOMIF6|CommentText4|||6500055654123|4542||J1||||MW09449|
00000001|0001333006|20191220|34621|SAPTOMIF6|CommentText5|||6500055654123|4542||J1||||MW09449|
00000001|0001333006|20191220|34621|SAPTOMIF6|CommentText6|||6500055654123|4542||J1||||MW09449|
00000001|0001333006|20191220|34621|SAPTOMIF6|CommentText7|||6500055654123|4542||J1||||MW09449|
解决方案
下面的代码将使用 flatMap(而不是 map 和 flatten)和 splitBy 的组合
%dw 2.0
output application/csv header = false , separator = "|" , quoteValues = false
var payload = read("<tXML>
<Message>
<DistributionOrder>
<LineItem>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>20</NoteCode>
<CommentText>[JPY_ _Mens_ _ _ _ _ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>13</NoteCode>
<CommentText>[ _00016000.00000_.00000_.00000_ _ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
</LineItem>
<LineItem>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>20</NoteCode>
<CommentText>[JPY_ _Mens_ _ _ _ _ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>13</NoteCode>
<CommentText>[ _00016000.00000_.00000_.00000_ _ _ _ _ _ _ _ ]</CommentText>
<Visibility>0</Visibility>
</Comment>
<Comment>
<NoteType>SC</NoteType>
<NoteCode>02</NoteCode>
<CommentText>[ _ _ _ _R]</CommentText>
<Visibility>0</Visibility>
</Comment>
<Comment>
<NoteType>MB</NoteType>
<NoteCode>05</NoteCode>
<CommentText>1_2_3_4_ _6</CommentText>
<Visibility>0</Visibility>
</Comment>
</LineItem>
</DistributionOrder>
</Message>
</tXML>", "application/xml")
fun outputMap (LineItem, column6="") =
{
column_1: "000000001",
column_2: "0",
column_3: "0",
column_4: (LineItem.LineProcessInfo.LineRefTextField2 splitBy "_")[0] replace "[" with "" default "",
column_5: (LineItem.LineProcessInfo.LineRefTextField2 splitBy "_")[1] replace "[" with "" default "",
column_6: column6 default "",
column_7: "",
column_8: "",
column_9: "",
column_10: ""
}
---
payload.tXML.Message.DistributionOrder.*LineItem flatMap (LineItem) ->
using (filteredComment = LineItem.*Comment[?($.NoteType == "MB" and $.NoteCode == "05")])
if (filteredComment != null)
filteredComment flatMap(commentObject) ->
using (splitCommentText = commentObject.CommentText splitBy /_/)
splitCommentText flatMap (commentText) -> outputMap(LineItem, commentText)
else
outputMap(LineItem)
这将导致
000000001|0|0|||||||
000000001|0|0|||1||||
000000001|0|0|||2||||
000000001|0|0|||3||||
000000001|0|0|||4||||
000000001|0|0||| ||||
000000001|0|0|||6||||
请注意,我已更新有效负载以根据给定条件显示评论。上面的代码序列将循环通过 LineItems -> 选择给定条件的注释 -> 循环通过选定的注释 -> 拆分注释文本 -> 遍历结果数组。我在这里添加了一个 if else 语句来检查所选/过滤的评论对象是否存在。
更新:
splitCommentText[?($$>0)] flatMap (commentText) -> outputMap(LineItem, commentText)
添加了过滤器以仅考虑索引 > 0 的值。
推荐阅读
- ansible - ansible 模块语法 - 什么 | 模块名之后是什么意思?
- java - Apache Karaf - CXF REST 服务器地址
- android - 特别是在 android 上的 SQLite 中,如果您更改构建它们的表的架构,是否需要“刷新”视图?
- python - Yield 导致 '...Iterator' 对象不可调用异常
- docker - 运行命令时 Dockerfile 出错-->docker logs test-client
- c++ - 我如何使用 cout<<"hello world"; 不使用;在行尾
- .net - CosmosDb 3.2 - 在没有聚合请求的情况下聚合失败
- powershell - 列出目录的名称和大小
- firebase - 如何在重新加载 Firebase Web 应用程序时获取 oauth 令牌?
- vue.js - Algolia vue-instantsearch 多次触发相同的查询