The equal-files package

[Tags: gpl, program]

Shell command that finds files with equal content in a given set of files. This is useful for finding duplicates in a set of documents where the same document might have been stored by accident with different names. Use it like

 equal-files `find my_dir -type f`


 find my_dir -type f | xargs equal-files

If the file names contain spaces, better use

 find my_dir -type f -printf "'%p'\n" | xargs equal-files


 equal_files -r my_dir

The program reads all input files simultaneously, driven by sorting and grouping of their content. However be prepared that due to the simultaneous access you may exceed the admissible number of opened files. Thus you may prefer to run it like

 equal_files -r -p 512 my_dir

The program can be used as a nice example of a declarative yet efficient implementation of a non-trivial algorithm, that is enabled by lazy evaluation.


Versions0.0.1, 0.0.2,, 0.0.3, 0.0.4, 0.0.5,,
Change logNone available
Dependenciesbase (>=3 && <5), bytestring (>=0.9 && <0.11), explicit-exception (==0.1.*), filemanip (>=0.3.5 && <0.4), transformers (>=0.2 && <0.5), utility-ht (>=0.0.1 && <0.1) [details]
AuthorHenning Thielemann <>
MaintainerHenning Thielemann <>
Home page
Source repositorythis: darcs get --tag
head: darcs get
UploadedSat Jul 25 17:59:01 UTC 2015 by HenningThielemann
Downloads1130 total (61 in last 30 days)
0 []
StatusDocs not available [build log]
Last success reported on 2015-09-28 [all 3 reports]


Maintainers' corner

For package maintainers and hackage trustees