The fast-tagsoup package

[Tags: bsd3, library]

Fast TagSoup parser. Speeds of 20-200MB/sec were observed.

Works only with strict bytestrings.

This library is intended to be used in conjunction with the original tagsoup package:

 import Text.HTML.TagSoup hiding (parseTags, renderTags)
 import Text.HTML.TagSoup.Fast

Besides speed fast-tagsoup correctly handles HTML <script> and <style> tags, converts tags to lower case and can decode non UTF-8 XML for you.

This parser is used in production in BazQux Reader feeds and comments crawler.

Properties

Versions1.0.0, 1.0.1, 1.0.2, 1.0.3, 1.0.4, 1.0.5, 1.0.6, 1.0.7, 1.0.8, 1.0.9
Change logNone available
Dependenciesbase (==4.*), bytestring, tagsoup, text, text-icu [details]
LicenseBSD3
CopyrightVladimir Shabanov 2011-2012
AuthorVladimir Shabanov <vshabanoff@gmail.com>
MaintainerVladimir Shabanov <vshabanoff@gmail.com>
CategoryXML
Home pagehttps://github.com/vshabanov/fast-tagsoup
Source repositoryhead: git clone https://github.com/vshabanov/fast-tagsoup
UploadedMon Sep 24 22:38:35 UTC 2012 by VladimirShabanov
DistributionsNixOS:1.0.7
Downloads1586 total (19 in last 30 days)
Votes
0 []
StatusDocs uploaded by user
Build status unknown [no reports yet]

Modules

[Index]

Downloads

Maintainers' corner

For package maintainers and hackage trustees