The wp-archivebot package

[ Tags: bsd3, network, program ] [ Propose Tags ]

A MediaWiki's RecentChanges or NewPages links to every new edit or article; this bot will poll the corresponding RSS feeds (easier and more reliable than parsing the HTML), follow the links to the new edit/article, and then use TagSoup to filter out every off-wiki link (eg. to

With this list of external links, the bot will then fire off requests to, which will make a backup (similar to the Internet Archive, but on-demand).

Example: to archive links from every article in the English Wikipedia's RecentChanges:

wp-archivebot ''


Versions 0.1
Dependencies base (==3.*), feed, HTTP, network, parallel, tagsoup [details]
License BSD3
Author Gwern
Category Network
Uploaded Thu Jun 4 16:31:50 UTC 2009 by GwernBranwen
Distributions NixOS:0.1
Executables wp-archivebot
Downloads 463 total (7 in the last 30 days)
Rating (no votes yet) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2017-01-01 [all 7 reports]
Hackage Matrix CI


Maintainer's Corner

For package maintainers and hackage trustees