bash 正确转义 sed 字符串

Question

提问by Chris Lieb

I have a regex and replacement pattern that have both been tested in Notepad++ on my input data and work correctly. When I put them into a sed expression, however, nothing gets matched.

我有一个正则表达式和替换模式，它们都在 Notepad++ 中对我的输入数据进行了测试并且正常工作。然而，当我将它们放入 sed 表达式时，没有任何匹配。

Here is the sed command:

这是 sed 命令：

 # SEARCH = ([a-zA-Z0-9.]+) [0-9] (.*)
 # REPLACE =  ()

 sed -e 's/\([a-zA-Z0-9.]+\) [0-9] \(.*\)/ \(\)/g'

Here is a sampling of the data:

以下是数据的样本：

jdoe 1 Doe, John
jad 1 Doe, Jane
smith 2 Smith, Jon

and the desired output:

和所需的输出：

Doe, John  (jdoe)
Doe, Jane  (jad)
Smith, Jon (smith)

I have tried removing and adding escapes to different characters in the sed expression, but either get nothing matched or something along the lines of:

我已经尝试在 sed 表达式中删除和添加不同字符的转义符，但要么没有匹配到任何内容，要么类似于：

sed: -e expression #1, char 42: invalid reference  on `s' command's RHS

How can I get this escaped correctly?

我怎样才能正确地转义这个？

Answer 1

回答by Mark Byers

I usually find it easier to use the -r switch as this means that escaping is similar to that of most other languages:

我通常发现使用 -r 开关更容易，因为这意味着转义类似于大多数其他语言：

sed -r 's/([a-zA-Z0-9.]+) [0-9] (.*)/ ()/g' file1.txt

Answer 2

回答by D.Shawley

A few warnings and additions to what everyone else has already said:

对其他人已经说过的一些警告和补充：

The -roption is a GNU extension to enable extended regular expressions. BSD derived sed's use -Einstead.
Sedand Grepuse Basic Regular Expressions
Awkuses Extended Regular Expressions
You should become comfortable with the POSIX specificationssuch as IEEE Std 1003.1if you want to write portable scripts, makefiles, etc.

该-r选项是一个 GNU 扩展，用于启用扩展的正则表达式。BSD 派生 sed 的使用-E代替。
Sed和Grep使用基本的正则表达式
awk使用扩展正则表达式
如果您想编写可移植的脚本、makefile 等，您应该熟悉POSIX 规范，例如IEEE Std 1003.1。

I would recommend rewriting the expression as

我建议将表达式重写为

's/\([a-zA-Z0-9.]\{1,\}\) [0-9] \(.*\)/ ()/g'

which should do exactly what you want in any POSIX compliant sed. If you do indeed care about such things, consider defining the POSIXLY_CORRECTenvironment variable.

在任何符合 POSIX 的sed. 如果您确实关心这些事情，请考虑定义POSIXLY_CORRECT环境变量。

Answer 3

回答by Paused until further notice.

The plus sign needs to be escaped when not using the -rswitch.

不使用-r开关时，加号需要转义。

Answer 4

回答by fwaechter

Using awk is much simpler...:

使用 awk 要简单得多...：

cat test.txt | awk '{ print  " "  " " "("")" }'

Output:

输出：

Doe, John (jdoe)
Doe, Jane (jad)
Smith, Jon (smith)

See man awk 1

参见 man awk 1

Answer 5

回答by ghostdog74

$ sed -e 's/\([a-zA-Z0-9.].*\) [0-9] \(.*\)/ \(\)/g' file
Doe, John (jdoe)
Doe, Jane (jad)
Smith, Jon (smith)

bash 正确转义 sed 字符串

提问by Chris Lieb

回答by Mark Byers

回答by D.Shawley

回答by Paused until further notice.

回答by fwaechter

回答by ghostdog74

相关推荐

最近更新

标签

bash 正确转义 sed 字符串

提问by Chris Lieb

回答by Mark Byers

回答by D.Shawley

回答by Paused until further notice.

回答by fwaechter

回答by ghostdog74

相关推荐

bash 在不排序的情况下删除变量上的重复项

bash “$$”在shell脚本中是什么意思？

bash shell编程中如何从键盘输入数据

Mac 上的 Bash 脚本创建信息弹出窗口

相关推荐

最近更新

标签