| Safe Haskell | Safe-Infered |
|---|
NLP.GenI.LexicalSelection
Contents
Description
This module performs the core of lexical selection and anchoring.
- type LexicalSelector = Macros -> Lexicon -> Sem -> IO LexicalSelection
- data LexicalSelection = LexicalSelection {
- lsAnchored :: [TagElem]
- lsLexEntries :: [LexEntry]
- lsWarnings :: GeniWarnings
- defaultLexicalSelector :: Macros -> Lexicon -> Sem -> IO LexicalSelection
- defaultLexicalSelection :: Macros -> Lexicon -> Sem -> LexicalSelection
- missingLexEntries :: [TagElem] -> [LexEntry] -> [LexEntry]
- defaultLexicalChoice :: Lexicon -> Sem -> [LexEntry]
- chooseCandI :: Sem -> [LexEntry] -> [LexEntry]
- mergeSynonyms :: [LexEntry] -> [LexEntry]
- type LexCombine a = MaybeT (Writer [LexCombineError]) a
- lexTell :: LexCombineError -> LexCombine ()
- defaultAnchoring :: Macros -> [LexEntry] -> Sem -> LexicalSelection
- combineList :: Sem -> Macros -> LexEntry -> ([LexCombineError], [TagElem])
- combineOne :: Sem -> LexEntry -> SchemaTree -> LexCombine [TagElem]
- enrich :: LexEntry -> SchemaTree -> LexCombine SchemaTree
- enrichBy :: SchemaTree -> PathEqPair -> LexCombine SchemaTree
- maybeEnrichBy :: SchemaTree -> PathEqPair -> Maybe (SchemaTree, Subst)
- enrichFeat :: AvPair GeniVal -> Flist [GeniVal] -> Maybe (Flist [GeniVal], Subst)
- missingCoanchors :: LexEntry -> SchemaTree -> [Text]
- lexEquations :: LexEntry -> Writer [LexCombineError] ([AvPair GeniVal], [PathEqPair])
- seekCoanchor :: NodePathEqLhs -> SchemaTree -> Maybe SchemaNode
- matchNodeName :: NodePathEqLhs -> SchemaNode -> Bool
- matchNodeNameHelper :: Text -> SchemaNode -> Bool
- setLemAnchors :: Tree (GNode GeniVal) -> Tree (GNode GeniVal)
- _lemanchor :: Text
- setOrigin :: Text -> Tree (GNode v) -> Tree (GNode v)
Lexical selection algorithms
type LexicalSelector = Macros -> Lexicon -> Sem -> IO LexicalSelectionSource
See Configuration if you want to use GenI with a custom
lexical selection function.
data LexicalSelection Source
The result of the lexical selection process
Constructors
| LexicalSelection | |
Fields
| |
defaultLexicalSelector :: Macros -> Lexicon -> Sem -> IO LexicalSelectionSource
Performs standard GenI lexical selection as described in http://projects.haskell.org/GenI/manual/lexical-selection.html
This is just defaultLexicalSelection lifted into IO
defaultLexicalSelection :: Macros -> Lexicon -> Sem -> LexicalSelectionSource
Helper for defaultLexicalSelector
(Standard GenI lexical selection is actually pure)
This is just defaultLexicalChoice and defaultAnchoring
missingLexEntries :: [TagElem] -> [LexEntry] -> [LexEntry]Source
missingLexEntries ts lexs returns any of the lexical candidates
lexs that were apparently not anchored succesfully.
TODO: it does this by (wrongly) checking for each lexical item
to see if any of the anchored trees in ts have identical
semantics to that lexical item. The better way to do this would
be to throw a subsumption check on top of items reported missing,
because it's possible for the trees to add semantics through
unification.
Selecting candidate lemmas
defaultLexicalChoice :: Lexicon -> Sem -> [LexEntry]Source
Select and returns the set of entries from the lexicon whose semantics subsumes the input semantics.
chooseCandI :: Sem -> [LexEntry] -> [LexEntry]Source
chooseCandI sem l attempts to unify the semantics of l with sem
If this succeeds, we use return the result(s); if it fails, we reject
l as a lexical selection candidate.
mergeSynonyms :: [LexEntry] -> [LexEntry]Source
mergeSynonyms is a factorisation technique that uses
atomic disjunction to merge all synonyms into a single lexical
entry. Two lexical entries are considered synonyms if their
semantics match and they point to the same tree families.
FIXME: 2006-10-11 - note that this is no longer being used, because it breaks the case where two lexical entries differ only by their use of path equations. Perhaps it's worthwhile just to add a check that the path equations match exactly.
Anchoring
type LexCombine a = MaybeT (Writer [LexCombineError]) aSource
The LexCombine monad supports warnings during lexical selection
and also failure via Maybe
lexTell :: LexCombineError -> LexCombine ()Source
Note an anchoring error
defaultAnchoring :: Macros -> [LexEntry] -> Sem -> LexicalSelectionSource
defaultAnchoring schemata lex sem implements the later half of lexical
selection (tree anchoring and enrichement). It assumes that lex consists
just of the lexical items that have been selected, and tries to combine them
with the tree schemata.
This function may be useful if you are implementing your own lexical selection
functions, and you want GenI to take over after you've given it a [LexEntry]
Combination
Arguments
| :: Sem | |
| -> Macros | |
| -> LexEntry | |
| -> ([LexCombineError], [TagElem]) | any warnings, plus the results |
Given a lexical item, looks up the tree families for that item, and anchor the item to the trees.
combineOne :: Sem -> LexEntry -> SchemaTree -> LexCombine [TagElem]Source
Combine a single tree with its lexical item to form a bonafide TagElem. This process can fail, however, because of filtering or enrichement
Enrichment
enrich :: LexEntry -> SchemaTree -> LexCombine SchemaTreeSource
See http://projects.haskell.org/manual/lexical-selection on enrichement
enrich helpers
enrichBy :: SchemaTree -> PathEqPair -> LexCombine SchemaTreeSource
Helper for enrich (enrich by single path equation)
maybeEnrichBy :: SchemaTree -> PathEqPair -> Maybe (SchemaTree, Subst)Source
Helper for enrichBy
enrichFeat :: AvPair GeniVal -> Flist [GeniVal] -> Maybe (Flist [GeniVal], Subst)Source
enrichFeat av fs attempts to unify av with fs
Note here that fs is an Flist [GeniVal] rather than the usual
Flist GeniVal you may expect. This is because it comes from
SchemaTree which allows non-atomic disjunctions of GeniVal
which have to be flatten down to at most atomic disjunctions once
lexical selection is complete.
missingCoanchors :: LexEntry -> SchemaTree -> [Text]Source
missingCoanchors l t returns the list of coanchor node names from l
that were not found in t
lexEquations :: LexEntry -> Writer [LexCombineError] ([AvPair GeniVal], [PathEqPair])Source
Split a lex entry's path equations into interface enrichement equations or (co-)anchor modifiers
seekCoanchor :: NodePathEqLhs -> SchemaTree -> Maybe SchemaNodeSource
seekCoanchor lhs t returns Just node if t contains exactly one
node that can be identified by lhs, Nothing if it contains none.
It crashes if there is more than one such node, because this should have been caught earlier by GenI.
matchNodeName :: NodePathEqLhs -> SchemaNode -> BoolSource
matchNodeName lhs n is True if the lhs refers to the node n
matchNodeNameHelper :: Text -> SchemaNode -> BoolSource
matchNodeNameHelper recognises “anchor“ by convention; otherwise,
it does a name match
Lemanchor mechanism
setLemAnchors :: Tree (GNode GeniVal) -> Tree (GNode GeniVal)Source
The lemanchor mechanism is described in http://projects.haskell.org/manual/lexical-selection
The name of the lemanchor attribute (by convention; see source)