archiver: Archive supplied URLs in WebCite & Internet Archive
archiver is a daemon which will process a specified text file, each line of which is a URL, and will (randomly) one by one request that the URLs be archived or spidered by http://www.webcitation.org and http://www.archive.org for future reference.
Because the interface is a simple text file, this can be combined with other scripts; for example, a script using Sqlite to extract visited URLs from Firefox, or a program extracting URLs from Pandoc documents. (See http://www.gwern.net/Archiving%20URLs.html.)
For explanation of the derivation of the code in Network.URL.Archiver
,
see http://www.gwern.net/haskell/Wikipedia%20Archive%20Bot.html.
Downloads
- archiver-0.3.1.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
Versions [RSS] | 0.1, 0.2, 0.3, 0.3.1, 0.4, 0.5, 0.5.1, 0.6.0, 0.6.1, 0.6.2, 0.6.2.1 |
---|---|
Dependencies | base (>=4 && <5), bytestring, containers, curl, HTTP, network, random [details] |
Tested with | ghc ==6.12.1 |
License | BSD-3-Clause |
Author | Gwern |
Maintainer | Gwern <gwern0@gmail.com> |
Category | Documentation, Network |
Source repo | head: darcs get http://community.haskell.org/~gwern/archiver/ |
Uploaded | by GwernBranwen at 2011-05-17T16:06:27Z |
Distributions | |
Reverse Dependencies | 1 direct, 0 indirect [details] |
Executables | archiver |
Downloads | 9229 total (30 in the last 30 days) |
Rating | (no votes yet) [estimated by Bayesian average] |
Your Rating | |
Status | Docs uploaded by user Build status unknown [no reports yet] |