The fast-tagsoup package

[Tags: bsd3, library]

Fast TagSoup parser. Speeds of 20-200MB/sec were observed.

Works only with strict bytestrings.

This library is intended to be used in conjunction with the original tagsoup package:

 import Text.HTML.TagSoup hiding (parseTags, renderTags)
 import Text.HTML.TagSoup.Fast

Besides speed fast-tagsoup correctly handles HTML <script> and <style> tags, converts tags to lower case and can decode non UTF-8 XML for you.

This parser is used in production in BazQux Reader feeds and comments crawler.

Properties

Versions1.0.0, 1.0.1, 1.0.2, 1.0.3, 1.0.4, 1.0.5, 1.0.6, 1.0.7
Change logNone available
Dependenciesbase (==4.*), bytestring, tagsoup, text, text-icu [details]
LicenseBSD3
CopyrightVladimir Shabanov 2011-2012
AuthorVladimir Shabanov <vshabanoff@gmail.com>
MaintainerVladimir Shabanov <vshabanoff@gmail.com>
CategoryXML
Home pagehttps://github.com/vshabanov/fast-tagsoup
Source repositoryhead: git clone https://github.com/vshabanov/fast-tagsoup
UploadedWed Aug 22 21:38:16 UTC 2012 by VladimirShabanov
DistributionsNixOS:1.0.7
Downloads1193 total (79 in last 30 days)
Votes
0 []
StatusDocs uploaded by user
Build status unknown [no reports yet]

Modules

[Index]

Downloads

Maintainers' corner

For package maintainers and hackage trustees