chatter-0.5.2.0: A library of simple NLP algorithms.

Safe HaskellNone
LanguageHaskell2010

NLP.POS.LiteralTagger

Synopsis

Documentation

mkTagger :: Tag t => Map Text t -> CaseSensitive -> Maybe (POSTagger t) -> POSTagger t Source

Create a Literal Tagger using the specified back-off tagger as a fall-back, if one is specified.

This uses a tokenizer adapted from the tokenize package for a tokenizer, and Erik Kow's fullstop sentence segmenter as a sentence splitter.

readTagger :: Tag t => ByteString -> Maybe (POSTagger t) -> Either String (POSTagger t) Source

deserialization for Literal Taggers. The serialization logic is in the posSerialize record of the POSTagger created in mkTagger.

data CaseSensitive Source

Boolean type to indicate case sensitivity for textual comparisons.

Constructors

Sensitive 
Insensitive 

protectTerms :: [Text] -> CaseSensitive -> Tokenizer Source

Create a tokenizer that protects the provided terms (to tokenize multi-word terms)