The tagsoup-ht package

[ Tags: deprecated, gpl, library, program, xml ] [ Propose Tags ]
Deprecated. in favor of tagchup

TagSoup is a package for parsing and extracting information from (possibly malformed) HTML/XHTML documents. Here I present my own parser, which I find (of course) more comprehensible and easier to extend. It also handles XML declarations and CDATA sections correctly.


Versions 0.2, 0.3
Dependencies base, data-accessor (==0.1.*), mtl, tagsoup (==0.6.*) [details]
License GPL
Author Henning Thielemann <>
Maintainer Henning Thielemann <>
Category XML
Home page
Uploaded Sun Nov 30 20:28:29 UTC 2008 by HenningThielemann
Distributions NixOS:0.3
Executables validate-tagsoup, tagsouptest
Downloads 597 total (56 in the last 30 days)
Rating (no votes yet) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]
Hackage Matrix CI




Maintainer's Corner

For package maintainers and hackage trustees