html-tokenizer: An "attoparsec"-based HTML tokenizer

[ html, library, mit, parsing, xml ] [ Propose Tags ]

This library can be used as a basis for complex HTML parsers, or for streaming. E.g., by composing it with the "list-t-attoparsec" library you can produce a token stream, thus becoming able to implement a highly efficient stream-parser, which works in a single pass, constant memory and is capable of early termination. "list-t-html-parser" is such a parser.

Versions [faq] 0.2.1.1, 0.2.1.2, 0.3.0.0, 0.3.0.1, 0.3.0.2, 0.3.0.3, 0.4.0.0, 0.4.1, 0.5, 0.5.1, 0.5.2, 0.6, 0.6.1, 0.6.2, 0.6.3, 0.6.4
Dependencies attoparsec (>=0.10 && <0.14), base-prelude (>=0.1.19 && <0.2), case-insensitive (==1.2.*), conversion (>=1.0.1 && <2), conversion-case-insensitive (==1.*), conversion-text (>=1.0.0.1 && <2), text (==1.*) [details]
License MIT
Copyright (c) 2015, Nikita Volkov
Author Nikita Volkov <nikita.y.volkov@mail.ru>
Maintainer Nikita Volkov <nikita.y.volkov@mail.ru>
Category Parsing, HTML, XML
Home page https://github.com/nikita-volkov/html-tokenizer
Bug tracker https://github.com/nikita-volkov/html-tokenizer/issues
Source repo head: git clone git://github.com/nikita-volkov/html-tokenizer.git
Uploaded by NikitaVolkov at Wed Jul 22 11:05:27 UTC 2015
Distributions NixOS:0.6.4
Downloads 5313 total (220 in the last 30 days)
Rating (no votes yet) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Hackage Matrix CI
Docs available [build log]
Last success reported on 2015-07-22 [all 1 reports]

Modules

[Index]

Downloads

Maintainer's Corner

For package maintainers and hackage trustees