Python：如何使用 xml.dom.minidom 获取 XML 元素的文本内容？

Question

提问by mindthief

I've called elems = xmldoc.getElementsByTagName('myTagName')on an XML object that I parsed as minidom.parse(xmlObj). Now I'm trying to get the text content of this element, and although I spent a while looking through the dir() and trying things out, I haven't found the call yet. As an example of what I want to accomplish, in:

我调用elems = xmldoc.getElementsByTagName('myTagName')了一个 XML 对象，我将其解析为minidom.parse(xmlObj). 现在我正在尝试获取该元素的文本内容，虽然我花了一段时间查看 dir() 并尝试了一些东西，但我还没有找到调用。作为我想要完成的一个例子，在：

<myTagName> Hello there </myTagName>

I would like the extract just "Hello there". (obviously I could parse this myself but I expect there is some built-in functionality)

我想要摘录只是“你好”。（显然我可以自己解析，但我希望有一些内置功能）

Thanks

谢谢

Answer 1

采纳答案by ismail

Try like this:

像这样尝试：

xmldoc.getElementsByTagName('myTagName')[0].firstChild.nodeValue

Answer 2

回答by James Thompson

for elem in elems:
    print elem.firstValue.nodeValue

That will print out each myTagName's text.

这将打印出每个 myTagName 的文本。

James

詹姆士

Answer 3

回答by mike rodent

wait a mo... do you want ALL the text under a given node? It has then to involve a subtree traversal function of some kind. Doesn't have to be recursive but this works fine:

等一下...你想要给定节点下的所有文本吗？然后它必须涉及某种类型的子树遍历函数。不必是递归的，但这工作正常：

    def get_all_text( node ):
        if node.nodeType ==  node.TEXT_NODE:
            return node.data
        else:
            text_string = ""
            for child_node in node.childNodes:
                text_string += get_all_text( child_node )
            return text_string

Python：如何使用 xml.dom.minidom 获取 XML 元素的文本内容？

提问by mindthief

采纳答案by ismail

回答by James Thompson

回答by mike rodent

相关推荐

最近更新

标签

Python：如何使用 xml.dom.minidom 获取 XML 元素的文本内容？

提问by mindthief

采纳答案by ismail

回答by James Thompson

回答by mike rodent

相关推荐

python中的Hadoop Streaming Job失败错误

Python：获取异常的错误消息

Python - 将日期的字符串表示形式转换为 ISO 8601

从维基百科文章中提取第一段 (Python)

相关推荐

最近更新

标签