The tagsoup-ht package

[Tags: deprecated, gpl, library, program]

Deprecated in favor of tagchup

TagSoup is a package for parsing and extracting information from (possibly malformed) HTML/XHTML documents. Here I present my own parser, which I find (of course) more comprehensible and easier to extend. It also handles XML declarations and CDATA sections correctly. This package is abandoned and will be renamed to Tagchup.


Versions0.2, 0.3
Change logNone available
Dependenciesbase (==3.*), bytestring (>= && <0.10), containers (>=0.1 && <0.3), data-accessor (==0.2.*), explicit-exception (==0.1.*), old-time (==1.0.*), tagsoup (==0.6.*), transformers (>=0.0 && <0.2), utility-ht (>=0.0.1 && <0.1), xml-basic (>=0.0.1 && <0.1) [details]
AuthorHenning Thielemann <>
MaintainerHenning Thielemann <>
Home page
Executablesvalidate-tagsoup, tagsoupspeed, tagsouptest
UploadedWed Mar 4 21:57:08 UTC 2009 by HenningThielemann
Downloads388 total (12 in last 30 days)
0 []
StatusDocs uploaded by user
Build status unknown [no reports yet]




Maintainers' corner

For package maintainers and hackage trustees