我通常使用標準庫中的ElementTree模塊解析XML。
它沒有給你一個字典,你得到了一個更有用的DOM結構,它允許你為孩子們遍歷每個元素。from xml.etree import ElementTree as ET
xml = ET.parse("
root_element = xml.getroot()
for child in root_element:
...
如果需要將其解析為字典,而不是從DOM樹中獲取所需的信息,則從根節點構建字典的遞歸函數如下:def xml_dict(node, path="", dic =None):
if dic == None:
dic = {}
name_prefix = path + ("." if path else "") + node.tag
numbers = set()
for similar_name in dic.keys():
if similar_name.startswith(name_prefix):
numbers.add(int (similar_name[len(name_prefix):].split(".")[0] ) )
if not numbers:
numbers.add(0)
index = max(numbers) + 1
name = name_prefix + str(index)
dic[name] = node.text + "<...>".join(childnode.tail
if childnode.tail is not None else
"" for childnode in node)
for childnode in node:
xml_dict(childnode, name, dic)
return dic
對于上面列出的XML,將生成此字典:{'A1': '\n \n <...>\n',
'A1.B1': '\n \n <...>\n ',
'A1.B1.C1': '"blah"',
'A1.B1.C2': '"blah"',
'A1.B2': '\n \n <...>\n ',
'A1.B2.C1': '"blah"',
'A1.B2.C2': '"blah"'}
(我發現DOM表單更有用)