Safe Haskell	None
Language	Haskell2010

Text.XML.Light.Extractors

Contents

Errors
Element extraction
Contents extraction

Description

A library for making extraction of information from parsed XML easier.

Example

Suppose you have an xml file of books like this:

<?xml version="1.0"?>
<library>
  <book id="1" isbn="23234-1">
    <author>John Doe</author>
    <title>Some book</title>
  </book>
  <book id="2">
    <author>You</author>
    <title>The Great Event</title>
  </book>
  ...
</library>

And a data type for a book:

data Book = Book { bookId        :: Int
                 , isbn          :: Maybe String
                 , author, title :: String
                 }

You can parse the xml file into a generic tree structure using parseXMLDoc from the xml package.

Using this library one can define extractors to extract data from the generic tree.

   library = element "library" $ children $ only $ many book

   book = element "book" $ do
            i <- attribAs "id" integer
            s <- optional (attrib "isbn")
            children $ do
              a <- element "author" $ contents $ text
              t <- element "title" $ contents $ text
              return $ Book { bookId = i, author = a, title = t, isbn = s }

   extractLibrary :: Element -> Either ExtractionErr [Book]
   extractLibrary = extractDocContents library

Notes

The Control.Applicative module contains some useful combinators like optional, many and <|>.
The Text.XML.Light.Extractors.ShowErr contains some predefined functions to convert error values to strings.
The Text.XML.Light.Extractors.Extra module provides some functions to read numeric data.

Synopsis

Errors

type Path = [String] Source

Location for some content.

data Err Source

Extraction errors.

Constructors

ErrExpect	Some expected content is missing
Fields expected :: String expected content found :: Content found content
ErrAttr	An expected attribute is missing
Fields expected :: String expected content atElement :: Element element with missing attribute
ErrEnd	Expected end of contents
Fields found :: Content found content
ErrNull	Unexpected end of contents
Fields expected :: String expected content
ErrMsg String