The fast-tagsoup package

[Tags:bsd3, library]

Fast TagSoup parser. Speeds of 20-200MB/sec were observed.

Works only with strict bytestrings.

This library is intended to be used in conjunction with the original tagsoup package:

 import Text.HTML.TagSoup hiding (parseTags, renderTags)
 import Text.HTML.TagSoup.Fast

Besides speed fast-tagsoup correctly handles HTML <script> and <style> tags, converts tags to lower case and can decode non UTF-8 XML for you.

This parser is used in production in BazQux Reader feeds and comments crawler.

Properties

Versions 1.0.0, 1.0.1, 1.0.2, 1.0.3, 1.0.4, 1.0.5, 1.0.6, 1.0.7, 1.0.8, 1.0.9, 1.0.10, 1.0.11, 1.0.12
Dependencies base (==4.*), bytestring, tagsoup, text, text-icu [details]
License BSD3
Copyright Vladimir Shabanov 2011-2012
Author Vladimir Shabanov <vshabanoff@gmail.com>
Maintainer Vladimir Shabanov <vshabanoff@gmail.com>
Stability Unknown
Category XML
Home page https://github.com/vshabanov/fast-tagsoup
Source repository head: git clone https://github.com/vshabanov/fast-tagsoup
Uploaded Mon Sep 24 22:38:35 UTC 2012 by VladimirShabanov
Distributions NixOS:1.0.12
Downloads 1694 total (40 in the last 30 days)
Votes
0 []
Status Docs uploaded by user
Build status unknown [no reports yet]

Modules

[Index]

Downloads

Maintainer's Corner

For package maintainers and hackage trustees