The equal-files package

[ Tags: console, gpl, program ] [ Propose Tags ]

Shell command that finds files with equal content in a given set of files. This is useful for finding duplicates in a set of documents where the same document might have been stored by accident with different names. Use it like

equal-files `find my_dir -type f`

or

find my_dir -type f | xargs equal-files

If the file names contain spaces, better use

find my_dir -type f -printf "'%p'\n" | xargs equal-files

The program reads all input files simultaneously, driven by sorting and grouping of their content. However be prepared that due to the simultaneous access you may exceed the admissible number of opened files.

The program can be used as a nice example of a declarative yet efficient implementation of a non-trivial algorithm, that is enabled by lazy evaluation.

Properties

Versions 0.0.1, 0.0.2, 0.0.2.1, 0.0.3, 0.0.4, 0.0.5, 0.0.5.1, 0.0.5.2, 0.0.5.3
Dependencies base (>=1.0 && <2 || >=3), bytestring (==0.9.*) [details]
License GPL
Author Henning Thielemann <haskell@henning-thielemann.de>
Maintainer Henning Thielemann <haskell@henning-thielemann.de>
Category Console
Home page http://code.haskell.org/~thielema/equal-files
Uploaded Tue Dec 16 23:57:55 UTC 2008 by HenningThielemann
Distributions LTSHaskell:0.0.5.3, NixOS:0.0.5.3, Stackage:0.0.5.3, Tumbleweed:0.0.5.3
Executables equal-files
Downloads 2148 total (273 in the last 30 days)
Rating (no votes yet) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
Last success reported on 2017-01-02 [all 7 reports]
Hackage Matrix CI

Flags

NameDescriptionDefaultType
splitbase

Choose the new smaller, split-up base package.

EnabledAutomatic

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Maintainer's Corner

For package maintainers and hackage trustees