首页 > 解决方案 > 尽管根据 getchildren() 存在子元素,python lxml 无法识别子元素

问题描述

我有以下 XML:

<?xml version='1.0' encoding='UTF-8'?>
<S:Envelope xmlns:S="http://schemas.xmlsoap.org/soap/envelope/">
    <S:Body>
        <ns7:GetStopMonitoringServiceResponse xmlns:ns3="http://www.siri.org.uk/siri" xmlns:ns4="http://www.ifopt.org.uk/acsb" xmlns:ns5="http://www.ifopt.org.uk/ifopt" xmlns:ns6="http://datex2.eu/schema/1_0/1_0" xmlns:ns7="http://new.webservice.namespace">
            <Answer>
                <ns3:ResponseTimestamp>2019-03-31T09:00:52.912+03:00</ns3:ResponseTimestamp>
                <ns3:ProducerRef>ISR Siri Server (141.10)</ns3:ProducerRef>
                <ns3:ResponseMessageIdentifier>276480603</ns3:ResponseMessageIdentifier>
                <ns3:RequestMessageRef>0100700:1351669188:4684</ns3:RequestMessageRef>
                <ns3:Status>true</ns3:Status>
                <ns3:StopMonitoringDelivery version="IL2.71">
                    <ns3:ResponseTimestamp>2019-03-31T09:00:52.912+03:00</ns3:ResponseTimestamp>
                    <ns3:Status>true</ns3:Status>
                    <ns3:MonitoredStopVisit>
                        <ns3:RecordedAtTime>2019-03-31T09:00:52.000+03:00</ns3:RecordedAtTime>
                        <ns3:ItemIdentifier>-881202701</ns3:ItemIdentifier>
                        <ns3:MonitoringRef>20902</ns3:MonitoringRef>
                        <ns3:MonitoredVehicleJourney>
                            <ns3:LineRef>23925</ns3:LineRef>
                            <ns3:DirectionRef>2</ns3:DirectionRef>
                            <ns3:FramedVehicleJourneyRef>
                                <ns3:DataFrameRef>2019-03-31</ns3:DataFrameRef>
                                <ns3:DatedVehicleJourneyRef>36962685</ns3:DatedVehicleJourneyRef>
                            </ns3:FramedVehicleJourneyRef>
                            <ns3:PublishedLineName>15</ns3:PublishedLineName>
                            <ns3:OperatorRef>15</ns3:OperatorRef>
                            <ns3:DestinationRef>26020</ns3:DestinationRef>
                            <ns3:OriginAimedDepartureTime>2019-03-31T08:35:00.000+03:00</ns3:OriginAimedDepartureTime>
                            <ns3:VehicleLocation>
                                <ns3:Longitude>34.78000259399414</ns3:Longitude>
                                <ns3:Latitude>32.042293548583984</ns3:Latitude>
                            </ns3:VehicleLocation>
                            <ns3:VehicleRef>37629301</ns3:VehicleRef>
                            <ns3:MonitoredCall>
                                <ns3:StopPointRef>20902</ns3:StopPointRef>
                                <ns3:ExpectedArrivalTime>2019-03-31T09:03:00.000+03:00</ns3:ExpectedArrivalTime>
                            </ns3:MonitoredCall>
                        </ns3:MonitoredVehicleJourney>
                    </ns3:MonitoredStopVisit>
                </ns3:StopMonitoringDelivery>
            </Answer>
        </ns7:GetStopMonitoringServiceResponse>
    </S:Body>
</S:Envelope>

我使用 lxml objectify 将 xml 转换为对象,然后尝试按照文档中的描述访问子级

这是我的加载代码:

from lxml import objectify
obj = objectify.fromstring(xml_content)

虽然以下代码可以正常工作:

print(obj.Body.tag)

{http://schemas.xmlsoap.org/soap/envelope/}Body

绑定访问 Body 子 (GetStopMonitoringServiceResponse) 时出现错误:

print(obj.Body.GetStopMonitoringServiceResponse.tag)

AttributeError: no such child: {http://schemas.xmlsoap.org/soap/envelope/}GetStopMonitoringServiceResponse

但是当我尝试获取 Body 的孩子时,我确实看到了这个元素:

print(obj.Body.getchildren())

[<Element {http://new.webservice.namespace}GetStopMonitoringServiceResponse at 0x1d6e17f0908>]

我在这里想念什么?

标签: pythonlxmllxml.objectify

解决方案


要访问绑定到与其父级不同的命名空间的元素,您可以使用getattr()

g = getattr(obj.Body, "{http://new.webservice.namespace}GetStopMonitoringServiceResponse")
print(g.tag)

以下也有效:

g = obj.Body["{http://new.webservice.namespace}GetStopMonitoringServiceResponse"]
print(g.tag)

参考:https ://lxml.de/objectify.html#namespace-handling 。


推荐阅读