The tagsoup package
TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
Properties
| Versions | 0.1, 0.4, 0.6, 0.8, 0.9, 0.10, 0.10.1, 0.11, 0.11.1, 0.12, 0.12.1, 0.12.2, 0.12.3, 0.12.4, 0.12.5, 0.12.6 |
|---|---|
| Dependencies | base (4.*), bytestring, containers, text |
| License | BSD3 |
| Copyright | Neil Mitchell 2006-2011 |
| Author | Neil Mitchell <ndmitchell@gmail.com> |
| Maintainer | Neil Mitchell <ndmitchell@gmail.com> |
| Category | XML |
| Home page | http://community.haskell.org/~ndm/tagsoup/ |
| Source repository | darcs get http://community.haskell.org/~ndm/darcs/tagsoup/ |
| Executables | tagsoup |
| Upload date | Sun Dec 11 15:21:52 UTC 2011 |
| Uploaded by | NeilMitchell |
| Built on | ghc-7.4 |
| Distributions | Fedora: 0.12.6, NixOS: 0.12.6, Debian: 0.12.6, FreeBSD: 0.12.6 |
Modules
Downloads
- tagsoup-0.12.6.tar.gz (Cabal source package)
- package description (included in the package)