archiver: Archive supplied URLs in WebCite & Internet Archive

[ bsd3, documentation, library, network, program ] [ Propose Tags ]

archiver is a daemon which will process a specified text file, each line of which is a URL, and will (randomly) one by one request that the URLs be archived or spidered by http://www.webcitation.org and http://www.archive.org for future reference.

Because the interface is a simple text file, this can be combined with other scripts; for example, a script using Sqlite to extract visited URLs from Firefox, or a program extracting URLs from Pandoc documents. (See http://www.gwern.net/Archiving%20URLs.html.)

For explanation of the derivation of the code in Network.URL.Archiver, see http://www.gwern.net/haskell/Wikipedia%20Archive%20Bot.html.

Modules

[Index]

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.1, 0.2, 0.3, 0.3.1, 0.4, 0.5, 0.5.1, 0.6.0, 0.6.1, 0.6.2, 0.6.2.1
Dependencies base (>=4 && <5), bytestring, containers, curl, HTTP, network, random [details]
License BSD-3-Clause
Author Gwern
Maintainer Gwern <gwern0@gmail.com>
Category Documentation, Network
Source repo head: darcs get http://community.haskell.org/~gwern/archiver/
Uploaded by GwernBranwen at 2011-05-17T16:06:27Z
Distributions
Reverse Dependencies 1 direct, 0 indirect [details]
Executables archiver
Downloads 9065 total (26 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]