html-tokenizer: An "attoparsec"-based HTML tokenizer

[ html, library, mit, parsing, xml ] [ Propose Tags ]

This library can be used as a basis for complex HTML parsers, or for streaming. E.g., by composing it with the "list-t-attoparsec" library you can produce a token stream, thus becoming able to implement a highly efficient stream-parser, which works in a single pass, constant memory and is capable of early termination. "list-t-html-parser" is such a parser.

Modules

[Index]

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

Versions [RSS] 0.2.1.1, 0.2.1.2, 0.3.0.0, 0.3.0.1, 0.3.0.2, 0.3.0.3, 0.4.0.0, 0.4.1, 0.5, 0.5.1, 0.5.2, 0.6, 0.6.1, 0.6.2, 0.6.3, 0.6.4
Dependencies attoparsec (>=0.10 && <0.13), base-prelude (>=0.1.19 && <0.2), case-insensitive (>=1.2 && <1.3), conversion (>=1.0.1 && <2), conversion-case-insensitive (>=1 && <2), conversion-text (>=1.0.0.1 && <2), text (>=1 && <1.3) [details]
License MIT
Copyright (c) 2015, Nikita Volkov
Author Nikita Volkov <nikita.y.volkov@mail.ru>
Maintainer Nikita Volkov <nikita.y.volkov@mail.ru>
Category Parsing, HTML, XML
Home page https://github.com/nikita-volkov/html-tokenizer
Bug tracker https://github.com/nikita-volkov/html-tokenizer/issues
Source repo head: git clone git://github.com/nikita-volkov/html-tokenizer.git
Uploaded by NikitaVolkov at 2015-05-01T14:15:37Z
Distributions
Reverse Dependencies 1 direct, 1 indirect [details]
Downloads 9819 total (38 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs available [build log]
Last success reported on 2015-05-01 [all 1 reports]