shpider: Web automation library in Haskell.

[ bsd3, library, web ] [ Propose Tags ] [ Report a vulnerability ]

Shpider is a web automation library for Haskell. It allows you to quickly write crawlers, and for simple cases ( like following links ) even without reading the page source.

It has useful features such as turning relative links from a page into absolute links, options to authorize transactions only on a given domain, and the option to only download html documents.

It also provides a nice syntax for filling out forms.

An example:

runShpider $ do
     download "http://apage.com"
     theForm : _ <- getFormsByAction "http://anotherpage.com"
     sendForm $ fillOutForm theForm $ pairs $ do
           "occupation" =: "unemployed Haskell programmer"
           "location" =: "mother's house"

Shpider contains a patched version of the curl package, to fix cookie handling. The curl licence is therefore distributed with this package.

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.0.1, 0.0.2, 0.0.3, 0.0.4, 0.0.5, 0.0.6, 0.0.7, 0.1.0, 0.1.1, 0.2, 0.2.1.1
Dependencies base, bytestring, containers, mtl, regex-posix, tagsoup, tagsoup-parsec, url [details]
License BSD-3-Clause
Author Johnny Morrice
Maintainer Johnny Morrice <spoon@killersmurf.com>
Category Web
Home page http://www.killersmurf.com/projects/shpider
Uploaded by JohnnyMorrice at 2009-03-20T18:21:59Z
Distributions
Reverse Dependencies 1 direct, 0 indirect [details]
Downloads 8070 total (41 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]