Python 类型错误：预期的字符串或类似字节的对象

Question

提问by Vasanth Prabakar

I've written a scriptto parse html and print the text content only. I wanted to ignore the tags. But my program has a problem. I am not sure what it is. Please help me.

我编写了一个脚本来解析 html 并仅打印文本内容。我想忽略标签。但是我的程序有问题。我不确定它是什么。请帮我。

import urllib.request
import re
from bs4 import BeautifulSoup
url = "www.example.com"

def hi():
    dep = urllib.request.urlopen(url)
    soup = BeautifulSoup(dep, 'html.parser')
    for link in soup.find_all('p', string=True):
        result = re.sub(b'<.*?>', "", link)
        print (result)
hi()

The website link.

网站链接。

Answer 1

回答by Nikolay Fominyh

I believe, that you have NavigableStringin linkvariable.

我相信，你NavigableString的link变量。

Force cast it to string like:

强制将其强制转换为字符串，例如：

for link in soup.find_all('p', string=True):
    result = re.sub(b'<.*?>', "", str(link))
    print (result)

Python 类型错误：预期的字符串或类似字节的对象

提问by Vasanth Prabakar

回答by Nikolay Fominyh

相关推荐

最近更新

标签

Python 类型错误：预期的字符串或类似字节的对象

提问by Vasanth Prabakar

回答by Nikolay Fominyh

相关推荐

如何在flask后端运行python脚本？

Python pip 成功安装软件包，但未从命令行找到可执行文件

Python 脚本中的错误“预期的二维数组，而是得到一维数组：”？

Python OpenCV 3.1 drawContours '(-215) npoints > 0'

相关推荐

最近更新

标签