tagsoup-ht: alternative parser for the tagsoup package
TagSoup is a package for parsing and extracting information from (possibly malformed) HTML/XHTML documents. Here I present my own parser, which I find (of course) more comprehensible and easier to extend. It also handles XML declarations and CDATA sections correctly. This package is abandoned and will be renamed to Tagchup.
| Versions | 0.2, 0.3 |
|---|---|
| Dependencies | base (>=3 && <4), bytestring (>=0.9.0.1 && <0.10), containers (>=0.1 && <0.3), data-accessor (>=0.2 && <0.3), explicit-exception (>=0.1 && <0.2), old-time (>=1.0 && <1.1), tagsoup (>=0.6 && <0.7), transformers (>=0.0 && <0.2), utility-ht (>=0.0.1 && <0.1), xml-basic (>=0.0.1 && <0.1) |
| License | GPL |
| Author | Henning Thielemann <tagsoup@henning-thielemann.de> |
| Maintainer | Henning Thielemann <tagsoup@henning-thielemann.de> |
| Category | XML |
| Home page | http://code.haskell.org/~thielema/tagsoup-ht/ |
| Executables | tagsouptest, tagsoupspeed, validate-tagsoup |
| Upload date | Wed Mar 4 21:57:08 UTC 2009 |
| Uploaded by | HenningThielemann |
| Built on | ghc-6.10, ghc-6.12 |
| Distributions | Arch: 0.3 |
Modules
- Text
Downloads
- tagsoup-ht-0.3.tar.gz (Cabal source package)
- package description (included in the package)
