colada-0.5.6: Colada implements incremental word class class induction using online LDA
NLP.CoNLL
Synopsis
type Token = Vector TextSource
Token is a representation of a word, which consists of a number of fields.
Token
type Field = TextSource
Field is a part of a word token, such as word form, lemma or POS tag
Field
type Sentence = Vector TokenSource
Sentence is a vector of tokens.
Sentence
parse :: Text -> [Sentence]Source
parse text returns a lazy list of sentences.
parse text