shpider: Web automation library in Haskell.
Shpider is a web automation library for Haskell. It allows you to quickly write crawlers, and for simple cases ( like following links ) even without reading the page source.
It has useful features such as turning relative links from a page into absolute links, options to authorize transactions only on a given domain, and the option to only download html documents.
It also provides a nice syntax for filling out forms.
An example:
runShpider $ do download "http://apage.com" theForm : _ <- getFormsByAction "http://anotherpage.com" sendForm $ fillOutForm theForm $ pairs $ do "occupation" =: "unemployed Haskell programmer" "location" =: "mother's house"
Shpider contains a patched version of the curl package, to fix cookie handling. The curl licence is therefore distributed with this package.
Modules
[Index]
Downloads
- shpider-0.0.2.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
Versions [RSS] | 0.0.1, 0.0.2, 0.0.3, 0.0.4, 0.0.5, 0.0.6, 0.0.7, 0.1.0, 0.1.1, 0.2, 0.2.1.1 |
---|---|
Dependencies | base, bytestring, containers, mtl, regex-posix, tagsoup, tagsoup-parsec, url [details] |
License | BSD-3-Clause |
Author | Johnny Morrice |
Maintainer | Johnny Morrice <spoon@killersmurf.com> |
Category | Web |
Home page | http://www.killersmurf.com/projects/shpider |
Uploaded | by JohnnyMorrice at 2009-03-20T18:21:59Z |
Distributions | |
Reverse Dependencies | 1 direct, 0 indirect [details] |
Downloads | 8070 total (41 in the last 30 days) |
Rating | (no votes yet) [estimated by Bayesian average] |
Your Rating | |
Status | Docs uploaded by user Build status unknown [no reports yet] |