colada-0.4.3: Colada implements incremental word class class induction using online LDA

Safe HaskellSafe-Infered

NLP.CoNLL

Synopsis

Documentation

type Token = Vector TextSource

Token is a representation of a word, which consists of a number of fields.

type Field = TextSource

Field is a part of a word token, such as word form, lemma or POS tag

type Sentence = Vector TokenSource

Sentence is a vector of tokens.

parse :: Text -> [Sentence]Source

parse text returns a lazy list of sentences.