Python beautifulsoup：bs4.element.ResultSet 对象或列表上的 find_all？

Question

提问by YJZ

Hi so I apply find_all on a beautifulsoup object, and find something, which is an bs4.element.ResultSet objector a list.

嗨，所以我在 a 上应用 find_all beautifulsoup object，然后找到一些东西，它是 anbs4.element.ResultSet object或 a list。

I want to further do find_all in there, but it's not allowed on a bs4.element.ResultSet object. I can loop through each element of the bs4.element.ResultSet objectto do find_all. But can I avoid looping and just convert it back to a beautifulsoup object?

我想在那里进一步做 find_all ，但不允许在 bs4.element.ResultSet object. 我可以遍历每个元素bs4.element.ResultSet object来做 find_all。但是我可以避免循环并将其转换回 abeautifulsoup object吗？

See code for details please. Thanks

详情请看代码。谢谢

html_1 = """
<table>
    <thead>
        <tr class="myClass">
            <th>A</th>
            <th>B</th>
            <th>C</th>
            <th>D</th>
        </tr>
    </thead>
</table>
"""
soup = BeautifulSoup(html_1, 'html.parser')

type(soup) #bs4.BeautifulSoup

# do find_all on beautifulsoup object
th_all = soup.find_all('th')

# the result is of type bs4.element.ResultSet or similarly list
type(th_all) #bs4.element.ResultSet
type(th_all[0:1]) #list

# now I want to further do find_all
th_all.find_all(text='A') #not work

# can I avoid this need of loop?
for th in th_all:
    th.find_all(text='A') #works

Answer 1

回答by alecxe

ResultSetclass is a subclass of a listand not a Tagclasswhich has the find*methods defined. Looping through the results of find_all()is the most common approach:

ResultSetclass 是列表的子类，而不是定义了方法的Tag类find*。循环遍历结果find_all()是最常见的方法：

th_all = soup.find_all('th')
result = []
for th in th_all:
    result.extend(th.find_all(text='A'))

Usually, CSS selectorsmay help you solve it in one go except that not everything you can do with find_all()is possible with the select()method. For instance, there is no "text" search available in bs4CSS selectors. But, if, for example, you had to find all, say, belements inside thelements, you could do:

通常，CSS 选择器可以帮助您一次性解决问题，但并非您可以find_all()使用该select()方法完成所有操作。例如，bs4CSS 选择器中没有可用的“文本”搜索。但是，例如，如果您必须在b元素内找到所有th元素，您可以这样做：

soup.select("th td")

Python beautifulsoup：bs4.element.ResultSet 对象或列表上的 find_all？

提问by YJZ

回答by alecxe

相关推荐

最近更新

标签

Python beautifulsoup：bs4.element.ResultSet 对象或列表上的 find_all？

提问by YJZ

回答by alecxe

相关推荐

Python 在 Flask 应用程序中运行 Dash 应用程序

Python 如何使用正则表达式提取熊猫数据框中的特定内容？

Python 如何删除DataFrame中除某些列之外的所有列？

Python gitignore 中应该包含什么，如何将 env 文件夹放入 gitignore 中，我的文件夹结构是否正确？

相关推荐

最近更新

标签