The fast-tagsoup-utf8-only package

[Tags: bsd3, library]

Fast TagSoup parser. Speeds of 20-200MB/sec were observed.

Works only with strict bytestrings.

This library is intended to be used in conjunction with the original tagsoup package:

 import Text.HTML.TagSoup hiding (parseTags, renderTags)
 import Text.HTML.TagSoup.Fast.Utf8Only

Besides speed fast-tagsoup correctly handles HTML <script> and <style> tags and converts tags to lower case. This fork purposefully removes support for parsing non-utf8 documents, to avoid dependency on text-icu. If you need to handle other encodings, refer to the original

This parser is used in production in BazQux Reader feeds and comments crawler.


Versions1.0.4, 1.0.5
Change logNone available
Dependenciesbase (==4.*), bytestring, tagsoup, text [details]
CopyrightVladimir Shabanov 2011-2012
AuthorVladimir Shabanov <>
MaintainerVladimir Shabanov <>
Home page
Source repositoryhead: git clone
UploadedWed Dec 11 20:23:49 UTC 2013 by MikhailKuddah
Downloads448 total (10 in last 30 days)
0 []
StatusDocs available [build log]
Successful builds reported [all 1 reports]




Maintainers' corner

For package maintainers and hackage trustees