tagsoup: Parsing and extracting information from (possibly malformed) HTML/XML documents

[ bsd3, library, xml ] [ Propose Tags ]

TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Versions 0.1, 0.4, 0.6, 0.8, 0.9, 0.10, 0.10.1, 0.11, 0.11.1, 0.12, 0.12.1, 0.12.2, 0.12.3, 0.12.4, 0.12.5, 0.12.6, 0.12.7, 0.12.8, 0.13, 0.13.1, 0.13.2, 0.13.3, 0.13.4, 0.13.5, 0.13.6, 0.13.7, 0.13.8, 0.13.9, 0.13.10, 0.14, 0.14.1, 0.14.2, 0.14.3, 0.14.4, 0.14.5, 0.14.6
Dependencies base (>=4 && <4.4), bytestring, containers, deepseq (==1.1.*), directory, network, process, QuickCheck (==2.4.*), time [details]
License BSD-3-Clause
Copyright Neil Mitchell 2006-2011
Author Neil Mitchell <ndmitchell@gmail.com>
Maintainer Neil Mitchell <ndmitchell@gmail.com>
Revised Revision 2 made by AdamBergmark at Thu May 21 15:30:40 UTC 2015
Category XML
Home page http://community.haskell.org/~ndm/tagsoup/
Uploaded by NeilMitchell at Wed Jun 1 20:01:40 UTC 2011
Distributions Arch:0.14.6, Debian:0.13.6, Fedora:0.14.2, FreeBSD:0.13.3, LTSHaskell:0.14.6, NixOS:0.14.6, Stackage:0.14.6, openSUSE:0.14.6
Executables tagsoup
Downloads 146357 total (663 in the last 30 days)
Rating 2.5 (votes: 3) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]
Hackage Matrix CI

Modules

[Index]

Flags

NameDescriptionDefaultType
testprog

Build the test program

DisabledAutomatic
download

Build with Download module

DisabledAutomatic

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Note: This package has metadata revisions in the cabal description newer than included in the tarball. To unpack the package including the revisions, use 'cabal get'.

Maintainer's Corner

For package maintainers and hackage trustees