The archiver package

[Tags: bsd3, library, program]

archiver is a daemon which will process a specified text file, each line of which is a URL, and will (randomly) one by one request that the URLs be archived or spidered by http://www.webcitation.org and http://www.archive.org for future reference. (One may optionally specify an arbitrary sh command like wget --page-requisites to download URLs locally.)

Because the interface is a simple text file, this can be combined with other scripts; for example, a script using Sqlite to extract visited URLs from Firefox, or a program extracting URLs from Pandoc documents. (See http://www.gwern.net/Archiving%20URLs.)

For explanation of the derivation of the code in Network.URL.Archiver, see http://www.gwern.net/haskell/Wikipedia%20Archive%20Bot.


Properties

Versions0.1, 0.2, 0.3, 0.3.1, 0.4, 0.5, 0.5.1, 0.6.0, 0.6.1, 0.6.2, 0.6.2.1
Dependenciesbase (==4.*), bytestring, containers, curl, HTTP, network, process, random
LicenseBSD3
AuthorGwern
MaintainerGwern <gwern0@gmail.com>
Stabilityprovisional
CategoryDocumentation, Network
Source repositoryhead: darcs get http://community.haskell.org/~gwern/archiver/
Executablesarchiver
Upload dateTue Jun 21 00:51:30 UTC 2011
Uploaded byGwernBranwen
Downloads800 total (61 in last 30 days)

Modules

[Index]

Downloads

Maintainers' corner

For package maintainers and hackage trustees