colada-0.5.6: Colada implements incremental word class class induction using online LDA

Safe HaskellNone




type Token = Vector TextSource

Token is a representation of a word, which consists of a number of fields.

type Field = TextSource

Field is a part of a word token, such as word form, lemma or POS tag

type Sentence = Vector TokenSource

Sentence is a vector of tokens.

parse :: Text -> [Sentence]Source

parse text returns a lazy list of sentences.