tagsoup-0.12.8: Parsing and extracting information from (possibly malformed) HTML/XML documents

The tagsoup package

TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Properties

Versions0.1, 0.4, 0.6, 0.8, 0.9, 0.10, 0.10.1, 0.11, 0.11.1, 0.12, 0.12.1, 0.12.2, 0.12.3, 0.12.4, 0.12.5, 0.12.6, 0.12.7, 0.12.8
Dependenciesbase (4.*), bytestring, containers, text
LicenseBSD3
CopyrightNeil Mitchell 2006-2012
AuthorNeil Mitchell <ndmitchell@gmail.com>
MaintainerNeil Mitchell <ndmitchell@gmail.com>
CategoryXML
Home pagehttp://community.haskell.org/~ndm/tagsoup/
Source repositorydarcs get http://community.haskell.org/~ndm/darcs/tagsoup/
Executablestagsoup
Upload dateMon Sep 17 19:05:27 UTC 2012
Uploaded byNeilMitchell
Built onghc-7.6
DistributionsFedora: 0.12.8, NixOS: 0.12.8, Debian: 0.12.8, FreeBSD: 0.12.8

Modules

Downloads