chatter-0.9.1.0: A library of simple NLP algorithms.

Safe HaskellNone
LanguageHaskell2010

NLP.POS.LiteralTagger

Synopsis

Documentation

mkTagger :: Tag t => Map Text t -> CaseSensitive -> Maybe (POSTagger t) -> POSTagger t Source #

Create a Literal Tagger using the specified back-off tagger as a fall-back, if one is specified.

This uses a tokenizer adapted from the tokenize package for a tokenizer, and Erik Kow's fullstop sentence segmenter as a sentence splitter.

readTagger :: Tag t => ByteString -> Maybe (POSTagger t) -> Either String (POSTagger t) Source #

deserialization for Literal Taggers. The serialization logic is in the posSerialize record of the POSTagger created in mkTagger.

data CaseSensitive Source #

Boolean type to indicate case sensitivity for textual comparisons.

Constructors

Sensitive 
Insensitive 

protectTerms :: [Text] -> CaseSensitive -> Tokenizer Source #

Create a tokenizer that protects the provided terms (to tokenize multi-word terms)