nkjp-0.1.1: Manipulating the National Corpus of Polish (NKJP)

Safe HaskellNone

Text.NKJP.Morphosyntax

Description

Parsing the NKJP morphosyntax layer.

Synopsis

Documentation

parseMorph :: Text -> [Para Text]Source

Parse textual contents of the ann_morphosyntax.xml file.

readMorph :: FilePath -> IO [Para Text]Source

Parse the stand-alone ann_morphosyntax.xml file.

readCorpus :: FilePath -> IO [(FilePath, [Para Text])]Source

Parse NCP the .tar.gz corpus.