Safe Haskell | None |
---|---|
Language | Haskell2010 |
- tag :: Map Text Tag -> CaseSensitive -> [Sentence] -> [TaggedSentence]
- tagSentence :: Map Text Tag -> CaseSensitive -> Sentence -> TaggedSentence
- mkTagger :: Map Text Tag -> CaseSensitive -> Maybe POSTagger -> POSTagger
- taggerID :: ByteString
- readTagger :: ByteString -> Maybe POSTagger -> Either String POSTagger
- data CaseSensitive
- protectTerms :: [Text] -> CaseSensitive -> Tokenizer
Documentation
tag :: Map Text Tag -> CaseSensitive -> [Sentence] -> [TaggedSentence] Source
tagSentence :: Map Text Tag -> CaseSensitive -> Sentence -> TaggedSentence Source
mkTagger :: Map Text Tag -> CaseSensitive -> Maybe POSTagger -> POSTagger Source
Create a Literal Tagger using the specified back-off tagger as a fall-back, if one is specified.
This uses a tokenizer adapted from the tokenize
package for a
tokenizer, and Erik Kow's fullstop sentence segmenter as a sentence
splitter.
readTagger :: ByteString -> Maybe POSTagger -> Either String POSTagger Source
deserialization for Literal Taggers. The serialization logic is in the posSerialize record of the POSTagger created in mkTagger.
data CaseSensitive Source
Boolean type to indicate case sensitivity for textual comparisons.
Read CaseSensitive | |
Show CaseSensitive | |
Generic CaseSensitive | |
Serialize CaseSensitive | |
type Rep CaseSensitive |
protectTerms :: [Text] -> CaseSensitive -> Tokenizer Source
Create a tokenizer that protects the provided terms (to tokenize multi-word terms)