dawg-0.7.0: Directed acyclic word graphs

Safe HaskellNone

Data.DAWG.Static

Contents

Description

The module implements directed acyclic word graphs (DAWGs) internaly represented as minimal acyclic deterministic finite-state automata.

In comparison to Data.DAWG module the automaton implemented here:

  • Keeps all nodes in one array and therefore uses much less memory,
  • When weighed, it can be used to perform static hashing with hash and unHash functions,
  • Doesn't provide insert/delete family of operations.

Synopsis

DAWG type

newtype DAWG a b c Source

DAWG a b c constitutes an automaton with alphabet symbols of type a, node values of type Maybe b and additional transition labels of type c. Root is stored on the first position of the array.

Constructors

DAWG 

Fields

unDAWG :: Vector (Node (Maybe b) c)
 

Query

lookup :: (Unbox c, Enum a) => [a] -> DAWG a b c -> Maybe bSource

Find value associated with the key.

numStates :: DAWG a b c -> IntSource

Number of states in the automaton.

Index

index :: Enum a => [a] -> DAWG a b Weight -> Maybe IntSource

Position in a set of all dictionary entries with respect to the lexicographic order.

byIndex :: Enum a => Int -> DAWG a b Weight -> Maybe [a]Source

Find dictionary entry given its index with respect to the lexicographic order.

Hash

hash :: Enum a => [a] -> DAWG a b Weight -> Maybe IntSource

Perfect hashing function for dictionary entries. A synonym for the index function.

unHash :: Enum a => Int -> DAWG a b Weight -> Maybe [a]Source

Inverse of the hash function and a synonym for the byIndex function.

Construction

empty :: Unbox c => DAWG a b cSource

Empty DAWG.

fromList :: (Enum a, Ord b) => [([a], b)] -> DAWG a b ()Source

Construct DAWG from the list of (word, value) pairs. First a DAWG is created and then it is frozen using the freeze function.

fromListWith :: (Enum a, Ord b) => (b -> b -> b) -> [([a], b)] -> DAWG a b ()Source

Construct DAWG from the list of (word, value) pairs with a combining function. The combining function is applied strictly. First a DAWG is created and then it is frozen using the freeze function.

fromLang :: Enum a => [[a]] -> DAWG a () ()Source

Make DAWG from the list of words. Annotate each word with the () value. First a DAWG is created and then it is frozen using the freeze function.

freeze :: DAWG a b -> DAWG a b ()Source

Construct immutable version of the automaton.

Weight

type Weight = IntSource

Weight of a node corresponds to the number of final states reachable from the node. Weight of an edge is a sum of weights of preceding nodes outgoing from the same parent node.

weigh :: Unbox c => DAWG a b c -> DAWG a b WeightSource

Compute node weights and store corresponding values in transition labels.

Conversion

assocs :: (Enum a, Unbox c) => DAWG a b c -> [([a], b)]Source

Return all key/value pairs in the DAWG in ascending key order.

keys :: (Unbox c, Enum a) => DAWG a b c -> [[a]]Source

Return all keys of the DAWG in ascending order.

elems :: Unbox c => DAWG a b c -> [b]Source

Return all elements of the DAWG in the ascending order of their keys.