data-named: Data types for named entities
|Versions||0.1.0, 0.2.0, 0.3.0, 0.4.0, 0.5.0, 0.5.1, 0.5.2, 0.6.1|
|Dependencies||attoparsec, base (==4.*), containers, text [details]|
|Copyright||Copyright (c) 2012 IPI PAN|
|Category||Natural Language Processing|
|Source repo||head: git clone git://github.com/kawu/data-named.git|
|Uploaded||by JakubWaszczuk at Wed Oct 3 15:23:35 UTC 2012|
|Downloads||2889 total (24 in the last 30 days)|
|Rating||(no votes yet) [estimated by rule of succession]|
|Status||Docs uploaded by user
Build status unknown [no reports yet]
Hackage Matrix CI
The library provides data types which can be used to represent forest structures with labels stored in internal nodes and words kept in leaves. In particular, those types are well suited for representing the layer of named entities (NEs).
The IOB method is implemented in the Data.Named.IOB module and can be used to translate between a forest of entities and a sequence of compound IOB labels. This method can be used together with a sequence classifier to indirectly model forest structures.
The Data.Named.Graph module can be used to represent more general, graph structures of entities. The module provides also a lossy conversion from a DAG to a disjoint forest of entities.
For package maintainers and hackage trustees