用于查找字符串中所有单词的 Python 正则表达式

Question

提问by TNT

Hello I am new into regex and I'm starting out with python. I'm stuck at extracting all words from an English sentence. So far I have:

您好，我是 regex 的新手，我从 python 开始。我被困在从一个英语句子中提取所有单词。到目前为止，我有：

import re

shop="hello seattle what have you got"
regex = r'(\w*) '
list1=re.findall(regex,shop)
print list1

This gives output:

这给出了输出：

['hello', 'seattle', 'what', 'have', 'you']

['你好'，'西雅图'，'什么'，'有'，'你']

If I replace regex by

如果我将正则表达式替换为

regex = r'(\w*)\W*'

then output:

然后输出：

['hello', 'seattle', 'what', 'have', 'you', 'got', '']

['你好'，'西雅图'，'什么'，'有'，'你'，'得到'，'']

whereas I want this output

而我想要这个输出

['hello', 'seattle', 'what', 'have', 'you', 'got']

['你好'，'西雅图'，'什么'，'有'，'你'，'有']

Please point me where I am going wrong.

请指出我哪里出错了。

Answer 1

回答by Pranav C Balan

Use word boundary \b

使用词边界 \b

import re

shop="hello seattle what have you got"
regex = r'\b\w+\b'
list1=re.findall(regex,shop)
print list1

OP : ['hello', 'seattle', 'what', 'have', 'you', 'got']

or simply \w+is enough

或者干脆\w+就够了

import re

shop="hello seattle what have you got"
regex = r'\w+'
list1=re.findall(regex,shop)
print list1

OP : ['hello', 'seattle', 'what', 'have', 'you', 'got']

用于查找字符串中所有单词的 Python 正则表达式

提问by TNT

回答by Pranav C Balan

相关推荐

最近更新

标签

用于查找字符串中所有单词的 Python 正则表达式

提问by TNT

回答by Pranav C Balan

相关推荐

Python 导入 pandas_datareader 给出 ImportError：无法导入名称 'is_list_like'

Python 如何使用 asyncio 定期执行函数？

Python f-strings 给出 SyntaxError？

Python 将 Pandas 数据帧转换为 Spark 数据帧错误

相关推荐

最近更新

标签