Python 从字符串中删除非数字字符

Question

提问by Obcure

I have been given the task to remove all non numeric characters including spaces from a either text file or string and then print the new result next to the old characters for example:

我的任务是从文本文件或字符串中删除所有非数字字符，包括空格，然后在旧字符旁边打印新结果，例如：

Before:

前：

sd67637 8

After:

后：

sd67637 8 = 676378

As i am a beginner i do not know where to start with this task. Please Help

由于我是初学者，我不知道从哪里开始这项任务。请帮忙

Answer 1

采纳答案by mar mar

The easiest way is with a regexp

最简单的方法是使用正则表达式

import re
a = 'lkdfhisoe78347834 (())&/&745  '
result = re.sub('[^0-9]','', a)

print result
>>> '78347834745'

Answer 2

回答by Jon Clements

Loop over your string, char by char and only include digits:

循环遍历您的字符串，一个字符一个字符并且只包含数字：

new_string = ''.join(ch for ch in your_string if ch.isdigit())

Or use a regex on your string (if at some point you wanted to treat non-contiguous groups separately)...

或者在您的字符串上使用正则表达式（如果在某个时候您想单独处理非连续组）...

import re
s = 'sd67637 8' 
new_string = ''.join(re.findall(r'\d+', s))
# 676378

Then just printthem out:

然后print把它们拿出来：

print(old_string, '=', new_string)

Answer 3

回答by Saullo G. P. Castro

You can use string.ascii_lettersto identify your non-digits:

您可以使用string.ascii_letters来识别您的非数字：

from string import *

a = 'sd67637 8'
a = a.replace(' ', '')

for i in ascii_letters:
    a = a.replace(i, '')

In case you want to replace a colon, use quotes "instead of colons '.

如果您想替换冒号，请使用引号"而不是冒号'。

Answer 4

回答by Inbar Rose

There is a builtinfor this.

有一个内置的。

string.translate(s, table[, deletechars])
Delete all characters from s that are in deletechars (if present), and then translate the characters using table, which must be a 256-character string giving the translation for each character value, indexed by its ordinal. If table is None, then only the character deletion step is performed.

string.translate(s, table[, deletechars])
从 s 中删除 deletechars 中的所有字符（如果存在），然后使用 table 转换字符，它必须是一个 256 字符的字符串，给出每个字符值的转换，按其序数索引。如果 table 为 None，则仅执行字符删除步骤。

>>> import string
>>> non_numeric_chars = ''.join(set(string.printable) - set(string.digits))
>>> non_numeric_chars = string.printable[10:]  # more effective method. (choose one)
'sd67637 8'.translate(None, non_numeric_chars)
'676378'

Or you could do it with no imports (but there is no reason for this):

或者你可以在没有进口的情况下做到这一点（但没有理由这样做）：

>>> chars = 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!"#$%&\'()*+,-./:;<=>?@[\]^_`{|}~ \t\n\r\x0b\x0c'
>>> 'sd67637 8'.translate(None, chars)
'676378'

Python 从字符串中删除非数字字符

提问by Obcure

采纳答案by mar mar

回答by Jon Clements

回答by Saullo G. P. Castro

回答by Inbar Rose

相关推荐

最近更新

标签

Python 从字符串中删除非数字字符

提问by Obcure

采纳答案by mar mar

回答by Jon Clements

回答by Saullo G. P. Castro

回答by Inbar Rose

相关推荐

使用python将数据从csv复制到postgresql

Python 类型错误：'dict_keys' 对象不支持索引

如何使用 Selenium WebDriver for python 在浏览器上打开一个新窗口？

使用 Turtle 图形的 Python 蛇游戏

相关推荐

最近更新

标签