The html-parse package

This is a package candidate release! Here you can preview how this package release will appear once published to the main package index (which can be accomplished via the 'maintain' link below). Please note that once a package has been published to the main package index it cannot be undone! Please consult the package uploading documentation for more information.

[maintain]

This package provides a fast and reasonably robust HTML5 tokenizer built upon the attoparsec library. The parsing strategy is based upon the HTML5 parsing specification with few deviations.

The package targets similar use-cases to the venerable tagsoup library, but is significantly more efficient, achieving parsing speeds of over 50 megabytes per second on modern hardware with and typical web documents.

Properties

Versions0.1.0.0, 0.2.0.0, 0.2.0.0, 0.2.0.1
Change logNone available
Dependenciesattoparsec (==0.13.*), base (>=4.8 && <4.10), deepseq (==1.4.*), text (==1.2.*) [details]
LicenseBSD3
Copyright(c) 2016 Ben Gamari
AuthorBen Gamari
Maintainerben@smart-cactus.org
CategoryText
Home pagehttp://github.com/bgamari/html-parse
Source repositoryhead: git clone git://github.com/bgamari/html-parse
UploadedWed Nov 23 17:00:59 UTC 2016 by BenGamari

Modules

[Index]

Downloads

Maintainers' corner

For package maintainers and hackage trustees