tagsoup: Parsing and extracting information from (possibly malformed) HTML/XML documents

[ bsd3, library, xml ] [ Propose Tags ]

TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Versions 0.1, 0.4, 0.6, 0.8, 0.9, 0.10, 0.10.1, 0.11, 0.11.1, 0.12, 0.12.1, 0.12.2, 0.12.3, 0.12.4, 0.12.5, 0.12.6, 0.12.7, 0.12.8, 0.13, 0.13.1, 0.13.2, 0.13.3, 0.13.4, 0.13.5, 0.13.6, 0.13.7, 0.13.8, 0.13.9, 0.13.10, 0.14, 0.14.1, 0.14.2, 0.14.3, 0.14.4, 0.14.5, 0.14.6
Dependencies base (>=4 && <4.8), bytestring, containers, deepseq (>=1.1 && <1.4), directory, network, process, QuickCheck (>=2.4 && <2.6), text, time [details]
License BSD-3-Clause
Copyright Neil Mitchell 2006-2012
Author Neil Mitchell <ndmitchell@gmail.com>
Maintainer Neil Mitchell <ndmitchell@gmail.com>
Revised Revision 1 made by AdamBergmark at Thu Apr 2 15:28:45 UTC 2015
Category XML
Home page http://community.haskell.org/~ndm/tagsoup/
Source repo head: darcs get http://community.haskell.org/~ndm/darcs/tagsoup/
Uploaded by NeilMitchell at Sun Aug 19 16:29:34 UTC 2012
Distributions Arch:0.14.6, Debian:0.13.6, Fedora:0.14.2, FreeBSD:0.13.3, LTSHaskell:0.14.6, NixOS:0.14.6, Stackage:0.14.6, openSUSE:0.14.6
Executables tagsoup
Downloads 145453 total (351 in the last 30 days)
Rating 2.5 (votes: 3) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]
Hackage Matrix CI

Modules

[Index]

Flags

NameDescriptionDefaultType
testprog

Build the test program

DisabledAutomatic
download

Build with Download module

DisabledAutomatic

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Note: This package has metadata revisions in the cabal description newer than included in the tarball. To unpack the package including the revisions, use 'cabal get'.

Maintainer's Corner

For package maintainers and hackage trustees