archiver-0.3.1: Archive supplied URLs in WebCite & Internet Archive

Network.URL.Archiver

Synopsis

Documentation

checkArchiveSource

Arguments

:: String

email for WebCite to send status to

-> String

URL to archive

-> IO () 

Error check the URL and then archive it using webciteArchive and alexaArchive

webciteArchive :: String -> String -> IO ()Source

Request http://www.webcitation.org to copy a supplied URL; WebCite does on-demand archiving, unlike Alexa/Internet Archive, and so in practice this is the most useful function. This function throws away any return status from WebCite (which may be changed in the future), so it is suggested that one test with a valid email address.

Warning! WebCite has throttling mechanisms; if you request more than 100 URLs per hour, your IP may be banned! It is suggested that one sleep for ~30 seconds between each URL request.

alexaArchive :: String -> IO ()Source

Request http://www.alexa.com to spider a supplied URL. Alexa supplies the Internet Archive's caches. TODO: currently broken? Alexa changed pages? is down?