Safe Haskell | None |
---|---|
Language | Haskell2010 |
- tag :: Tag t => Map Text t -> CaseSensitive -> [Sentence] -> [TaggedSentence t]
- tagSentence :: Tag t => Map Text t -> CaseSensitive -> Sentence -> TaggedSentence t
- mkTagger :: Tag t => Map Text t -> CaseSensitive -> Maybe (POSTagger t) -> POSTagger t
- taggerID :: ByteString
- readTagger :: Tag t => ByteString -> Maybe (POSTagger t) -> Either String (POSTagger t)
- data CaseSensitive
- protectTerms :: [Text] -> CaseSensitive -> Tokenizer
Documentation
tag :: Tag t => Map Text t -> CaseSensitive -> [Sentence] -> [TaggedSentence t] Source #
tagSentence :: Tag t => Map Text t -> CaseSensitive -> Sentence -> TaggedSentence t Source #
mkTagger :: Tag t => Map Text t -> CaseSensitive -> Maybe (POSTagger t) -> POSTagger t Source #
Create a Literal Tagger using the specified back-off tagger as a fall-back, if one is specified.
This uses a tokenizer adapted from the tokenize
package for a
tokenizer, and Erik Kow's fullstop sentence segmenter as a sentence
splitter.
readTagger :: Tag t => ByteString -> Maybe (POSTagger t) -> Either String (POSTagger t) Source #
deserialization for Literal Taggers. The serialization logic is in the posSerialize record of the POSTagger created in mkTagger.
data CaseSensitive Source #
Boolean type to indicate case sensitivity for textual comparisons.
protectTerms :: [Text] -> CaseSensitive -> Tokenizer Source #
Create a tokenizer that protects the provided terms (to tokenize multi-word terms)