data-named: Data types for named entities

[ bsd3, library, natural-language-processing ] [ Propose Tags ]

Please see the README on GitHub at https://github.com/kawu/data-named#readme


[Skip to Readme]

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.1.0, 0.2.0, 0.3.0, 0.4.0, 0.5.0, 0.5.1, 0.5.2, 0.6.1, 0.6.2
Dependencies attoparsec (>=0.12 && <0.14), base (>=4.8 && <5), binary (>=0.7 && <0.9), containers (>=0.5 && <0.7), text (>=1.1 && <1.3) [details]
License BSD-3-Clause
Copyright 2012-2019 IPI PAN, Jakub Waszczuk
Author Jakub Waszczuk
Maintainer waszczuk.kuba@gmail.com
Category Natural Language Processing
Home page https://github.com/kawu/data-named#readme
Bug tracker https://github.com/kawu/data-named/issues
Source repo head: git clone https://github.com/kawu/data-named
Uploaded by JakubWaszczuk at 2019-11-30T20:54:55Z
Distributions
Reverse Dependencies 3 direct, 0 indirect [details]
Downloads 6973 total (29 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs available [build log]
Last success reported on 2019-11-30 [all 1 reports]

Readme for data-named-0.6.2

[back to package description]

Data-named

The library provides data types which can be used to represent forest structures with labels stored in internal nodes and words kept in leaves. In particular, those types are well suited for representing the layer of named entities (NEs).

The IOB method is implemented in the Data.Named.IOB module and can be used to translate between a forest of entities and a sequence of compound IOB labels. This method can be used together with a sequence classifier to indirectly model forest structures.

The Data.Named.Graph module can be used to represent more general, graph structures of entities. The module provides also a lossy conversion from a DAG to a disjoint forest of entities.

Build Status