Python：替换为正则表达式

Question

提问by Pickels

I need to replace part of a string. I was looking through the Python documentation and found re.sub.

我需要替换字符串的一部分。我正在浏览 Python 文档并找到了 re.sub。

import re
s = '<textarea id="Foo"></textarea>'
output = re.sub(r'<textarea.*>(.*)</textarea>', 'Bar', s)
print output

>>>'Bar'

I was expecting this to print '<textarea id="Foo">Bar</textarea>'and not 'bar'.

我期待这个打印'<textarea id="Foo">Bar</textarea>'而不是'bar'。

Could anybody tell me what I did wrong?

谁能告诉我我做错了什么？

Answer 1

采纳答案by Mark Byers

Instead of capturing the part you want to replaceyou can capture the parts you want to keepand then refer to them using a reference \1to include them in the substituted string.

您可以捕获要保留的部分，而不是捕获要替换的部分，然后使用引用引用它们以将它们包含在替换字符串中。\1

Try this instead:

试试这个：

output = re.sub(r'(<textarea.*>).*(</textarea>)', r'Bar', s)

Also, assuming this is HTML you should consider using an HTML parser for this task, for example Beautiful Soup.

此外，假设这是 HTML，您应该考虑为此任务使用 HTML 解析器，例如Beautiful Soup。

Answer 2

回答by Rahul Agarwal

Or you could just use the search function instead:

或者你可以只使用搜索功能：

match=re.search(r'(<textarea.*>).*(</textarea>)', s)
output = match.group(1)+'bar'+match.group(2)
print output
>>>'<textarea id="Foo">bar</textarea>'

Python：替换为正则表达式

提问by Pickels

采纳答案by Mark Byers

回答by Rahul Agarwal

相关推荐

最近更新

标签

Python：替换为正则表达式

提问by Pickels

采纳答案by Mark Byers

回答by Rahul Agarwal

相关推荐

Python 理解 dict.copy() - 浅还是深？

Python CSVWriter 在我写数据的那一刻没有将数据保存到文件中

Python NameError：未定义全局名称

在 Python 中对字典（带有日期键）进行排序

相关推荐

最近更新

标签