fast-tagsoup-utf8-only: Fast parser for tagsoup package
Fast TagSoup parser. Speeds of 20-200MB/sec were observed.
Works only with strict bytestrings.
This library is intended to be used in conjunction with the original
import Text.HTML.TagSoup hiding (parseTags, renderTags) import Text.HTML.TagSoup.Fast.Utf8Only
fast-tagsoup correctly handles HTML
<style> tags and converts tags to lower case.
This fork purposefully removes support for parsing non-utf8 documents, to avoid dependency on text-icu.
If you need to handle other encodings, refer to the original http://hackage.haskell.org/package/fast-tagsoup
This parser is used in production in BazQux Reader feeds and comments crawler.
|Versions [RSS] [faq]||1.0.4, 1.0.5|
|Dependencies||base (==4.*), bytestring, tagsoup, text [details]|
|Copyright||Vladimir Shabanov 2011-2012|
|Author||Vladimir Shabanov <firstname.lastname@example.org>|
|Maintainer||Vladimir Shabanov <email@example.com>|
|Source repo||head: git clone https://github.com/exbb2/fast-tagsoup|
|Uploaded||by MikhailKuddah at 2013-12-11T20:23:49Z|
|Downloads||1858 total (20 in the last 30 days)|
|Rating||(no votes yet) [estimated by Bayesian average]|
Docs available [build log]
Successful builds reported [all 1 reports]