bash 如何在unix bash中删除文本文件中的重复行？

Question

提问by t28292

I just have a file.txt with multiple lines, I would like to remove duplicate lines without sorting the file. what command can i use in unix bash ?

我只有一个多行的 file.txt，我想删除重复的行而不对文件进行排序。我可以在 unix bash 中使用什么命令？

sample of file.txt

文件示例.txt

orangejuice;orange;juice_apple
pineapplejuice;pineapple;juice_pineapple
orangejuice;orange;juice_apple

sample of output:

输出样本：

orangejuice;orange;juice_apple
pineapplejuice;pineapple;juice_pineapple

Answer 1

回答by Steve

One way using awk:

一种使用方式awk：

awk '!a[perl -ne 'print unless $seen{$_}++' file.txt
]++' file.txt

Answer 2

回答by choroba

You can use Perl for this:

您可以为此使用 Perl：

##代码##

The -nswitch makes Perl process the file line by line. Each line ($_) is stored as a key in a hash named "seen", but since ++happens after returning the value, the line is printed the first time it is met.

该-n开关使 Perl 逐行处理文件。每行 ( $_) 作为键存储在名为“seen”的散列中，但由于++在返回值后发生，因此在第一次遇到时打印该行。

bash 如何在unix bash中删除文本文件中的重复行？

提问by t28292

回答by Steve

回答by choroba

相关推荐

最近更新

标签

bash 如何在unix bash中删除文本文件中的重复行？

提问by t28292

回答by Steve

回答by choroba

相关推荐

bash 解决 $1: 歧义重定向

bash “-ne”在bash中是什么意思？

bash 创建后立即断开符号链接

bash for 循环：一系列数字

相关推荐

最近更新

标签