bash 使用 sed 删除非字母数字字符

Question

提问by gorideyourbike

I am trying to validate some inputs to remove a set of characters. Only alphanumeric characters plus, period, underscore, hyphen are allowed. I've tested the regex expression [^\w.-]here http://gskinner.com/RegExr/and it matches what I want removed so I not sure why sedis returning the opposite. What am I missing?

我正在尝试验证一些输入以删除一组字符。只允许使用字母数字字符加、句点、下划线、连字符。我已经在[^\w.-]这里http://gskinner.com/RegExr/测试了正则表达式，它匹配我想要删除的内容，所以我不确定为什么sed返回相反的内容。我错过了什么？

My end goal is to input "?10.41.89.50 "and get "10.41.89.50".

我的最终目标是输入"?10.41.89.50 "并获得"10.41.89.50“.

I've tried:

我试过了：

echo "?10.41.89.50 " | sed s/[^\w.-]//greturns ?...

echo "?10.41.89.50 " | sed s/[^\w.-]//g返回 ?...

echo "?10.41.89.50 " | sed s/[\w.-]//gand echo "?10.41.89.50 " | sed s/[\w^.-]//greturns ?10418950

echo "?10.41.89.50 " | sed s/[\w.-]//g并echo "?10.41.89.50 " | sed s/[\w^.-]//g返回?10418950

I attempted the answer found here Skip/remove non-ascii character with sedbut nothing was removed.

我尝试了此处找到的答案，使用 sed 跳过/删除非 ascii 字符，但没有删除任何内容。

Answer 1

回答by iruvar

tr's -c(complement) flag may be an option

tr的-c（补充）标志可能是一个选项

echo "?10.41.89.50-._ " | tr -cd '[:alnum:]._-'

Answer 2

回答by gniourf_gniourf

You might want to use the [:alpha:]class instead:

您可能想改用[:alpha:]该类：

echo "?10.41.89.50 " | sed "s/[[:alpha:].-]//g"

should work. If not, you might need to change your local settings.

应该管用。如果没有，您可能需要更改本地设置。

On the other hand, if you only want to keep the digits, the hyphens and the period::

另一方面，如果您只想保留数字、连字符和句点：

echo "?10.41.89.50 " | sed "s/[^[:digit:].-]//g"

If your string is in a variable, you can use pure bash and parameter expansionsfor that:

如果您的字符串在变量中，您可以使用纯 bash 和参数扩展：

$ dirty="?10.41.89.50 "
$ clean=${dirty//[^[:digit:].-]/}
$ echo "$clean"
10.41.89.50

or

或者

$ dirty="?10.41.89.50 "
$ clean=${dirty//[[:alpha:]]/}
$ echo "$clean"
10.41.89.50

You can also have a look at 1_CR's answer.

你也可以看看1_CR的回答。

Answer 3

回答by anubhava

Well sed won't support unicode characters. Use perlinstead:

那么 sed 将不支持 unicode 字符。使用perl来代替：

> s="?10.41.89.50 "
> perl -pe 's/[^\w.-]+//g' <<< "$s"
10.41.89.50

Answer 4

回答by panticz

To remove all characters except of alphanumeric and "-" use this code:

要删除除字母数字和“-”之外的所有字符，请使用以下代码：

echo "a b-1_2" | sed "s/[^[:alnum:]-]//g"

Answer 5

回答by technerdius

<`[[:alnum:]_.@]`

This worked just fine for me. It preserved all of the characters I specified for my purposes.

这对我来说很好。它保留了我为我的目的指定的所有字符。

Answer 6

回答by Iwan Plays

Based on anubhava's answer, this one worked for me:

根据 anubhava 的回答，这个对我有用：

s/^[[:alnum:]]//g

Replaced anything other than alphanumeric with single space.

用单个空格替换字母数字以外的任何内容。

Note: "." characters get preserved

笔记： ”。” 字符得到保留

bash 使用 sed 删除非字母数字字符

提问by gorideyourbike

回答by iruvar

回答by gniourf_gniourf

回答by anubhava

回答by panticz

回答by technerdius

回答by Iwan Plays

相关推荐

最近更新

标签

bash 使用 sed 删除非字母数字字符

提问by gorideyourbike

回答by iruvar

回答by gniourf_gniourf

回答by anubhava

回答by panticz

回答by technerdius

回答by Iwan Plays

相关推荐

bash 在 shell 脚本中使用 grep 会导致找不到文件错误

bash HISTSIZE 与 HISTFILESIZE？

bash 无效的命令代码。，尽管有转义句点，但使用 sed

bash shell脚本中的整数表达式预期错误

相关推荐

最近更新

标签