Linux 在unix中查找两个文件之间差异的最快方法？

Question

提问by Steam

I want to find the difference between two files and then put only the differences in a third file. I saw different approaches using awk, diff and comm. Are there any more ?

我想找到两个文件之间的差异，然后只将差异放在第三个文件中。我看到了使用 awk、diff 和 comm 的不同方法。还有吗？

eg.Compare two files line by line and generate the difference in another file

例如。逐行比较两个文件并在另一个文件中生成差异

eg.Copy differences between two files in unix

例如。在 unix 中复制两个文件之间的差异

I need to know which is the fastest way of finding all the differences and listing them in a file for each of the cases below -

我需要知道哪种最快的方法可以找到所有差异并将它们列在以下每种情况的文件中 -

Case 1 - file2 = file1 + extra text appended.
Case 2 - file2 and file1 are different.

Answer 1

采纳答案by danmc

You could try..

你可以试试..

comm -13 <(sort file1) <(sort file2) > file3

or

或者

grep -Fxvf file1 file2 > file3

or

或者

diff file1 file2 | grep "<" | sed 's/^<//g'  > file3

or

或者

join -v 2 <(sort file1) <(sort file2) > file3

Answer 2

回答by P_M

You could also try to include md5-hash-sums or similar do determine whether there are any differences at all. Then, only compare files which have different hashes...

您也可以尝试包含 md5-hash-sums 或类似的内容来确定是否存在任何差异。然后，只比较具有不同哈希值的文件......

Answer 3

回答by pron

Another option:

另外一个选项：

sort file1 file2 | uniq -u > file3

If you want to see just the duplicate entries use "uniq -d" option:

如果您只想查看重复的条目，请使用“uniq -d”选项：

sort file1 file2 | uniq -d > file3

Answer 4

回答by James Bond 86

This will work fast:

这将快速工作：

Case 1 - File2 = File1 + extra text appended.

案例 1 - File2 = File1 + 附加的额外文本。

grep -Fxvf File2.txt File1.txt >> File3.txt

File 1: 80 Lines File 2: 100 Lines File 3: 20 Lines

文件 1：80 行文件 2：100 行文件 3：20 行

Linux 在unix中查找两个文件之间差异的最快方法？

提问by Steam

采纳答案by danmc

回答by P_M

回答by pron

回答by James Bond 86

相关推荐

最近更新

标签

Linux 在unix中查找两个文件之间差异的最快方法？

提问by Steam

采纳答案by danmc

回答by P_M

回答by pron

回答by James Bond 86

相关推荐

C# 在 AppDomain 之间共享数据

Linux BASH - 如何获取 EOF 标签内的变量值？

C# 如何获得图像的分辨率？（JPEG、GIF、PNG、JPG）

在其中创建目录和文件的一个命令 linux 命令

相关推荐

最近更新

标签