The tagsoup package
TagSoup is a library for extracting information out of unstructured HTML code, sometimes known as tag-soup. The HTML does not have to be well formed, or render properly within any particular framework. This library is for situations where the author of the HTML is not cooperating with the person trying to extract the information, but is also not trying to hide the information.
Properties
| Versions | 0.1, 0.4, 0.6, 0.8, 0.9, 0.10, 0.10.1, 0.11, 0.11.1, 0.12, 0.12.1, 0.12.2, 0.12.3, 0.12.4, 0.12.5, 0.12.6 |
|---|---|
| Dependencies | base, mtl, network |
| License | BSD3 |
| Copyright | 2006-8, Neil Mitchell |
| Author | Neil Mitchell |
| Maintainer | ndmitchell@gmail.com |
| Category | XML |
| Home page | http://www-users.cs.york.ac.uk/~ndm/tagsoup/ |
| Executables | tagsoup |
| Upload date | Mon Jan 14 17:57:13 UTC 2008 |
| Uploaded by | NeilMitchell |
| Built on | ghc-6.10, ghc-6.12, ghc-6.8 |
| Distributions | Fedora: 0.12.2, NixOS: 0.12.6, Debian: 0.12.6, FreeBSD: 0.12.2 |
Modules
Downloads
- tagsoup-0.4.tar.gz (Cabal source package)
- package description (included in the package)