hexpat-0.20.13: XML parser/formatter based on expat

Safe HaskellNone
LanguageHaskell98

Text.XML.Expat.Extended

Contents

Description

An extended variant of Node intended to implement the entire XML specification. DTDs are not yet supported, however.

The names conflict with those in Tree so you must use qualified import if you want to use both modules.

Synopsis

Tree structure

type Document a tag text = DocumentG a [] tag text Source #

A pure representation of an XML document that uses a list as its container type.

data DocumentG a c tag text Source #

Document representation of the XML document, intended to support the entire XML specification. DTDs are not yet supported, however.

Constructors

Document 

Instances

(Functor c, List c) => DocumentClass (DocumentG ann) c Source # 

Methods

getXMLDeclaration :: DocumentG ann c tag text -> Maybe (XMLDeclaration text) Source #

getDocumentTypeDeclaration :: DocumentG ann c tag text -> Maybe (DocumentTypeDeclaration c tag text) Source #

getTopLevelMiscs :: DocumentG ann c tag text -> c (Misc text) Source #

getRoot :: DocumentG ann c tag text -> NodeType (DocumentG ann) c tag text Source #

mkDocument :: Maybe (XMLDeclaration text) -> Maybe (DocumentTypeDeclaration c tag text) -> c (Misc text) -> NodeType (DocumentG ann) c tag text -> DocumentG ann c tag text Source #

(Eq tag, Eq text, Eq a) => Eq (DocumentG a [] tag text) Source # 

Methods

(==) :: DocumentG a [] tag text -> DocumentG a [] tag text -> Bool #

(/=) :: DocumentG a [] tag text -> DocumentG a [] tag text -> Bool #

(Show tag, Show text, Show a) => Show (DocumentG a [] tag text) Source # 

Methods

showsPrec :: Int -> DocumentG a [] tag text -> ShowS #

show :: DocumentG a [] tag text -> String #

showList :: [DocumentG a [] tag text] -> ShowS #

type NodeType (DocumentG ann) Source # 
type NodeType (DocumentG ann) = NodeG ann

type Node a tag text = NodeG a [] tag text Source #

A pure tree representation that uses a list as its container type, extended variant.

In the hexpat package, a list of nodes has the type [Node tag text], but note that you can also use the more general type function ListOf to give a list of any node type, using that node's associated list type, e.g. ListOf (UNode Text).

data NodeG a c tag text Source #

Extended variant of the tree representation of the XML document, intended to support the entire XML specification. DTDs are not yet supported, however.

c is the container type for the element's children, which is [] in the hexpat package, and a monadic list type for hexpat-iteratee.

tag is the tag type, which can either be one of several string types, or a special type from the Text.XML.Expat.Namespaced or Text.XML.Expat.Qualified modules.

text is the string type for text content.

a is the type of the annotation. One of the things this can be used for is to store the XML parse location, which is useful for error handling.

Note that some functions in the Text.XML.Expat.Cursor module need to create new nodes through the MkElementClass type class. Normally this can only be done if a is a Maybe type or () (so it can provide the Nothing value for the annotation on newly created nodes). Or, you can write your own MkElementClass instance. Apart from that, there is no requirement for a to be a Maybe type.

Constructors

Element 

Fields

Text !text 
CData !text 
Misc (Misc text) 

Instances

(Functor c, List c) => MkElementClass (NodeG (Maybe a)) c Source # 

Methods

mkElement :: tag -> Attributes tag text -> c (NodeG (Maybe a) c tag text) -> NodeG (Maybe a) c tag text Source #

(Functor c, List c) => MkElementClass (NodeG ()) c Source # 

Methods

mkElement :: tag -> Attributes tag text -> c (NodeG () c tag text) -> NodeG () c tag text Source #

(Functor c, List c) => NodeClass (NodeG a) c Source # 

Methods

isElement :: NodeG a c tag text -> Bool Source #

isText :: NodeG a c tag text -> Bool Source #

isCData :: NodeG a c tag text -> Bool Source #

isProcessingInstruction :: NodeG a c tag text -> Bool Source #

isComment :: NodeG a c tag text -> Bool Source #

textContentM :: Monoid text => NodeG a c tag text -> ItemM c text Source #

isNamed :: Eq tag => tag -> NodeG a c tag text -> Bool Source #

getName :: Monoid tag => NodeG a c tag text -> tag Source #

hasTarget :: Eq text => text -> NodeG a c tag text -> Bool Source #

getTarget :: Monoid text => NodeG a c tag text -> text Source #

getAttributes :: NodeG a c tag text -> [(tag, text)] Source #

getChildren :: NodeG a c tag text -> c (NodeG a c tag text) Source #

getText :: Monoid text => NodeG a c tag text -> text Source #

modifyName :: (tag -> tag) -> NodeG a c tag text -> NodeG a c tag text Source #

modifyAttributes :: ([(tag, text)] -> [(tag, text)]) -> NodeG a c tag text -> NodeG a c tag text Source #

modifyChildren :: (c (NodeG a c tag text) -> c (NodeG a c tag text)) -> NodeG a c tag text -> NodeG a c tag text Source #

modifyElement :: ((tag, [(tag, text)], c (NodeG a c tag text)) -> (tag', [(tag', text)], c (NodeG a c tag' text))) -> NodeG a c tag text -> NodeG a c tag' text Source #

mapAllTags :: (tag -> tag') -> NodeG a c tag text -> NodeG a c tag' text Source #

mapNodeContainer :: List c' => (forall b. c b -> ItemM c (c' b)) -> NodeG a c tag text -> ItemM c (NodeG a c' tag text) Source #

mkText :: text -> NodeG a c tag text Source #

(Eq tag, Eq text, Eq a) => Eq (NodeG a [] tag text) Source # 

Methods

(==) :: NodeG a [] tag text -> NodeG a [] tag text -> Bool #

(/=) :: NodeG a [] tag text -> NodeG a [] tag text -> Bool #

(Show tag, Show text, Show a) => Show (NodeG a [] tag text) Source # 

Methods

showsPrec :: Int -> NodeG a [] tag text -> ShowS #

show :: NodeG a [] tag text -> String #

showList :: [NodeG a [] tag text] -> ShowS #

(NFData tag, NFData text, NFData a) => NFData (NodeG a [] tag text) Source # 

Methods

rnf :: NodeG a [] tag text -> () #

type ListOf (NodeG a c tag text) Source # 
type ListOf (NodeG a c tag text) = c (NodeG a c tag text)

type UDocument a text = Document a text text Source #

Type alias for an extended document with unqualified tag names where tag and text are the same string type

type LDocument tag text = Document XMLParseLocation tag text Source #

Type alias for an extended document, annotated with parse location

type ULDocument text = Document XMLParseLocation text text Source #

Type alias for an extended document with unqualified tag names where tag and text are the same string type, annotated with parse location

type UNode a text = Node a text text Source #

Type alias for an extended node with unqualified tag names where tag and text are the same string type

type LNode tag text = Node XMLParseLocation tag text Source #

Type alias for an extended node, annotated with parse location

type ULNode text = LNode text text Source #

Type alias for an extended node with unqualified tag names where tag and text are the same string type, annotated with parse location

Generic document/node manipulation

Annotation-specific

modifyAnnotation :: (a -> a) -> Node a tag text -> Node a tag text Source #

Modify this node's annotation (non-recursively) if it's an element, otherwise no-op.

mapAnnotation :: (a -> b) -> Node a tag text -> Node b tag text Source #

Modify this node's annotation and all its children recursively if it's an element, otherwise no-op.

mapDocumentAnnotation :: (a -> b) -> Document a tag text -> Document b tag text Source #

Modify the annotation of every node in the document recursively.

Qualified nodes

type QDocument a text = Document a (QName text) text Source #

Type alias for an extended document where qualified names are used for tags

type QLDocument text = Document XMLParseLocation (QName text) text Source #

Type alias for an extended document where qualified names are used for tags, annotated with parse location

type QNode a text = Node a (QName text) text Source #

Type alias for an extended node where qualified names are used for tags

type QLNode text = LNode (QName text) text Source #

Type alias for an extended node where qualified names are used for tags, annotated with parse location

Namespaced nodes

type NDocument a text = Document a (NName text) text Source #

Type alias for an extended document where namespaced names are used for tags

type NLDocument text = Document XMLParseLocation (NName text) text Source #

Type alias for an extended document where namespaced names are used for tags, annotated with parse location

type NNode a text = Node a (NName text) text Source #

Type alias for an extended node where namespaced names are used for tags

type NLNode text = LNode (NName text) text Source #

Type alias for an extended node where namespaced names are used for tags, annotated with parse location

Parse to tree

data ParseOptions tag text Source #

Constructors

ParseOptions 

Fields

  • overrideEncoding :: Maybe Encoding

    The encoding parameter, if provided, overrides the document's encoding declaration.

  • entityDecoder :: Maybe (tag -> Maybe text)

    If provided, entity references (i.e.   and friends) will be decoded into text using the supplied lookup function

data Encoding Source #

Constructors

ASCII 
UTF8 
UTF16 
ISO88591 

parse Source #

Arguments

:: (GenericXMLString tag, GenericXMLString text) 
=> ParseOptions tag text

Parse options

-> ByteString

Input text (a lazy ByteString)

-> (LDocument tag text, Maybe XMLParseError) 

Lazily parse XML to tree. Note that forcing the XMLParseError return value will force the entire parse. Therefore, to ensure lazy operation, don't check the error status until you have processed the tree.

parse' Source #

Arguments

:: (GenericXMLString tag, GenericXMLString text) 
=> ParseOptions tag text

Parse options

-> ByteString

Input text (a strict ByteString)

-> Either XMLParseError (LDocument tag text) 

Strictly parse XML to tree. Returns error message or valid parsed tree.

data XMLParseError Source #

Parse error, consisting of message text and error location

data XMLParseLocation Source #

Specifies a location of an event within the input text

Constructors

XMLParseLocation 

Fields

Variant that throws exceptions

parseThrowing Source #

Arguments

:: (GenericXMLString tag, GenericXMLString text) 
=> ParseOptions tag text

Parse options

-> ByteString

Input text (a lazy ByteString)

-> LDocument tag text 

Lazily parse XML to tree. In the event of an error, throw XMLParseException.

parseThrowing can throw an exception from pure code, which is generally a bad way to handle errors, because Haskell's lazy evaluation means it's hard to predict where it will be thrown from. However, it may be acceptable in situations where it's not expected during normal operation, depending on the design of your program.

Convert from SAX

saxToTree :: (GenericXMLString tag, Monoid text) => [(SAXEvent tag text, a)] -> (Document a tag text, Maybe XMLParseError) Source #

A lower level function that lazily converts a SAX stream into a tree structure. Variant that takes annotations for start tags.

Abstraction of string types

class (Monoid s, Eq s) => GenericXMLString s where Source #

An abstraction for any string type you want to use as xml text (that is, attribute values or element text content). If you want to use a new string type with hexpat, you must make it an instance of GenericXMLString.