java 如何使用 Jsoup 提取单独的文本节点？

Question

提问by M.M

I have an element like this :

我有一个这样的元素：

<td> TextA <br/> TextB </td>

How can I extract TextA and TextB separately?

如何分别提取 TextA 和 TextB？

Answer 1

回答by BalusC

Several ways. That really depends on the document itself and whether the given HTML markup is consistent or not. In this particular example you could get the td's child nodes by Element#childNodes()and then test every node individually if it's a TextNodeor not.

几种方式。这实际上取决于文档本身以及给定的 HTML 标记是否一致。在此特定示例中，您可以通过获取td的子节点Element#childNodes()，然后单独测试每个节点是否为 a TextNode。

E.g.

例如

Element td = getItSomehow();

for (Node child : td.childNodes()) {
    if (child instanceof TextNode) {
        System.out.println(((TextNode) child).text());
    }
}

which results in

这导致

 TextA 
 TextB

I think it would be nice if Jsoup offered a Element#textNodes()or something to get the child text nodes like as Element#children()does to get the child elements (which would have returned the <br />element in your example).

我认为如果 Jsoup 提供 aElement#textNodes()或其他东西来获取子文本节点，就像Element#children()获取子元素一样（这将<br />在您的示例中返回元素）会很好。

java 如何使用 Jsoup 提取单独的文本节点？

提问by M.M

回答by BalusC

相关推荐

最近更新

标签

java 如何使用 Jsoup 提取单独的文本节点？

提问by M.M

回答by BalusC

相关推荐

java 断言失败错误

java OpenCV 在模板匹配上的表现

java 从密钥库加载证书

Java 中的 get() 或 elementAt()

相关推荐

最近更新

标签