tagsoup: Parsing and extracting information from (possibly malformed) HTML/XML documents

[ bsd3, library, xml ] [ Propose Tags ]

TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Versions [faq] 0.1, 0.4, 0.6, 0.8, 0.9, 0.10, 0.10.1, 0.11, 0.11.1, 0.12, 0.12.1, 0.12.2, 0.12.3, 0.12.4, 0.12.5, 0.12.6, 0.12.7, 0.12.8, 0.13, 0.13.1, 0.13.2, 0.13.3, 0.13.4, 0.13.5, 0.13.6, 0.13.7, 0.13.8, 0.13.9, 0.13.10, 0.14, 0.14.1, 0.14.2, 0.14.3, 0.14.4, 0.14.5, 0.14.6, 0.14.7, 0.14.8
Dependencies base (>=4 && <4.8), bytestring, containers, deepseq (==1.1.*), directory, network, process, QuickCheck (==2.4.*), text, time [details]
License BSD-3-Clause
Copyright Neil Mitchell 2006-2011
Author Neil Mitchell <ndmitchell@gmail.com>
Maintainer Neil Mitchell <ndmitchell@gmail.com>
Revised Revision 1 made by AdamBergmark at 2015-04-02T15:29:37Z
Category XML
Home page http://community.haskell.org/~ndm/tagsoup/
Source repo head: darcs get http://community.haskell.org/~ndm/darcs/tagsoup/
Uploaded by NeilMitchell at 2011-12-11T15:21:52Z
Distributions Arch:0.14.8, Debian:0.14.6, Fedora:0.14.8, FreeBSD:0.13.3, LTSHaskell:0.14.8, NixOS:0.14.8, Stackage:0.14.8, openSUSE:0.14.8
Executables tagsoup
Downloads 173657 total (741 in the last 30 days)
Rating 2.5 (votes: 3) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Hackage Matrix CI
Docs uploaded by user
Build status unknown [no reports yet]

Modules

[Index]

Flags

NameDescriptionDefaultType
testprog

Build the test program

DisabledAutomatic
download

Build with Download module

DisabledAutomatic

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Note: This package has metadata revisions in the cabal description newer than included in the tarball. To unpack the package including the revisions, use 'cabal get'.

Maintainer's Corner

For package maintainers and hackage trustees