bash SED 删除带有 REGEX 模式的行

Question

提问by Imkls

i've got a hundreds of files with thousands of lines, which i need to delete some lines that follows a pattern,so i went to SED with regex .The struct of files is something like this

我有数百个包含数千行的文件，我需要删除一些遵循模式的行，所以我用正则表达式去了 SED。文件的结构是这样的

A,12121212121212,foo,bar,lorem
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
A,21212121212121,foo,bar,lorem
C,32JL,JL
C,32JL,JL
C,32JL,JL
A,9999,88888,77777

I need to delete All the lines that starts with "A" and ends with "lorem"

我需要删除所有以“ A”开头并以“ lorem”结尾的行

Expected output-

预期输出-

C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
A,9999,88888,77777

I've made the Regex :

我已经制作了正则表达式：

^(A).*(lorem)

And it match in my text editor (Sublime,UltraEdit)

它在我的文本编辑器中匹配（Sublime，UltraEdit）

In the UNIX shell

在 UNIX 外壳中

sed '/^(A).*(lorem)/d' file.txt

But somehow it doesn't work,it shows the whole file, and i can't figure out why.

但不知何故它不起作用，它显示了整个文件，我不知道为什么。

Can someone help me please?

有人能帮助我吗？

Answer 1

回答by Aaron

The others gave you correct solutions but didn't explain why your regex didn't work. The ()surely were useless, but if you had used the regex with other tools/languages, you might very well have had the expected result.

其他人为您提供了正确的解决方案，但没有解释为什么您的正则表达式不起作用。该()肯定是无用的，但如果你已经使用其他工具/语言的正则表达式，你很可能会不得不预期的结果。

It didn't work with sedbecause it will by default use POSIX's basic regular expressions, where the characters for grouping are $and $, while (and )will match literal characters. There were no such brackets in your input text, so it didn't match.

它不起作用，sed因为它默认使用POSIX 的基本正则表达式，其中用于分组的字符是$and $，而(和)将匹配文字字符。您的输入文本中没有这样的括号，因此不匹配。

Your regular expression would have worked if you had used GNU's sed -ror BSD's sed -E, the flag switching to POSIX's extended regular expressions where (and )are used to group and match the literal brackets.

如果您使用了 GNUsed -r或 BSD 的正则表达式，您的正则表达式会起作用sed -E，该标志切换到 POSIX 的扩展正则表达式，其中(和)用于分组和匹配文字括号。

In conclusion, the following commands will do the same thing :

总之，以下命令将执行相同的操作：

sed '/^A.*lorem$/d' file.txt
sed -r '/^(A).*(lorem)$/d' file.txt(with GNU sed)
sed -E '/^(A).*(lorem)$/d' file.txt(with BSD sed and modern GNU sed)
sed '/^$A$.*$lorem$$/d' file.txt

sed '/^A.*lorem$/d' file.txt
sed -r '/^(A).*(lorem)$/d' file.txt（使用 GNU sed）
sed -E '/^(A).*(lorem)$/d' file.txt（使用 BSD sed 和现代 GNU sed）
sed '/^$A$.*$lorem$$/d' file.txt

Answer 2

回答by James Brown

$ sed '/^A.*lorem$/d' file.txt

^A: starts with an A
.*: stuff in the middle
lorem$: ends with lorem

^A: 开头 A
.*: 中间的东西
lorem$：以。。结束 lorem

Answer 3

回答by Chem-man17

Remove the brackets.

取下括号。

Using your code, the appropriate one-liner becomes-

使用您的代码，适当的单行代码变为-

sed '/^A.*lorem/d' file.txt

If you want to be more rigourous, you can look at James's answer which more correctly terminates the regex as-

如果你想更严格，你可以看看詹姆斯的回答，它更正确地终止了正则表达式——

sed '/^A.*lorem$/d' file.txt

Both will work.

两者都会起作用。

bash SED 删除带有 REGEX 模式的行

提问by Imkls

回答by Aaron

回答by James Brown

回答by Chem-man17

相关推荐

最近更新

标签

bash SED 删除带有 REGEX 模式的行

提问by Imkls

回答by Aaron

回答by James Brown

回答by Chem-man17

相关推荐

bash 禁用 Psql 输出中的换行

bash 成功等待脚本后如何启动docker容器

Bash：非法变量名错误

bash linux服务器上保存命令历史的所有位置在哪里

相关推荐

最近更新

标签