concraft-0.1.0: Morphosyntactic tagging tool based on constrained CRFs

Safe HaskellNone

NLP.Concraft.Plain

Contents

Description

Simple format for morphosyntax representation which assumes that all tags have a textual representation with no spaces inside and that one of the tags indicates unknown words.

Synopsis

Types

data Space Source

No space, space or newline.

Constructors

None 
Space 
NewLine 

Instances

data Token Source

A token.

Constructors

Token 

Fields

orth :: Text
 
space :: Space
 
known :: Bool
 
interps :: Map Interp Bool

Interpretations with disambiguation info.

Instances

data Interp Source

Constructors

Interp 

Fields

base :: Text
 
tag :: Text
 

Instances

Interface

fromTok :: Token -> (Word Text, Choice Text)Source

Extract information relevant for tagging.

choose :: Token -> Set Text -> TokenSource

Mark all interpretations with tag component beeing a member of the given choice set with disamb annotations.

addInterps :: Bool -> Token -> [Interp] -> TokenSource

Add new interpretations with given disamb annotation.

addNones :: Bool -> Token -> [Text] -> TokenSource

Add new interpretations with None base and given disamb annotation.

Parsing

Showing