parsers-0.12.5: Parsing combinators

Copyright(c) Edward Kmett 2011
(c) Daan Leijen 1999-2001
LicenseBSD3
Maintainerekmett@gmail.com
Stabilityexperimental
Portabilitynon-portable
Safe HaskellTrustworthy
LanguageHaskell2010

Text.Parser.Token

Contents

Description

Parsers that comprehend whitespace and identifier styles

idStyle    = haskellIdents { styleReserved = ... }
identifier = ident idStyle
reserved   = reserve idStyle

Synopsis

Token Parsing

whiteSpace :: TokenParsing m => m () Source #

Skip zero or more bytes worth of white space. More complex parsers are free to consider comments as white space.

charLiteral :: forall m. TokenParsing m => m Char Source #

This token parser parses a single literal character. Returns the literal character value. This parsers deals correctly with escape sequences. The literal character is parsed according to the grammar rules defined in the Haskell report (which matches most programming languages quite closely).

stringLiteral :: forall m s. (TokenParsing m, IsString s) => m s Source #

This token parser parses a literal string. Returns the literal string value. This parsers deals correctly with escape sequences and gaps. The literal string is parsed according to the grammar rules defined in the Haskell report (which matches most programming languages quite closely).

stringLiteral' :: forall m s. (TokenParsing m, IsString s) => m s Source #

This token parser behaves as stringLiteral, but for single-quoted strings.

natural :: TokenParsing m => m Integer Source #

This token parser parses a natural number (a non-negative whole number). Returns the value of the number. The number can be specified in decimal, hexadecimal or octal. The number is parsed according to the grammar rules in the Haskell report.

integer :: forall m. TokenParsing m => m Integer Source #

This token parser parses an integer (a whole number). This parser is like natural except that it can be prefixed with sign (i.e. '-' or '+'). Returns the value of the number. The number can be specified in decimal, hexadecimal or octal. The number is parsed according to the grammar rules in the Haskell report.

double :: TokenParsing m => m Double Source #

This token parser parses a floating point value. Returns the value of the number. The number is parsed according to the grammar rules defined in the Haskell report.

naturalOrDouble :: TokenParsing m => m (Either Integer Double) Source #

This token parser parses either natural or a float. Returns the value of the number. This parsers deals with any overlap in the grammar rules for naturals and floats. The number is parsed according to the grammar rules defined in the Haskell report.

integerOrDouble :: TokenParsing m => m (Either Integer Double) Source #

This token parser is like naturalOrDouble, but handles leading - or +.

scientific :: TokenParsing m => m Scientific Source #

This token parser parses a floating point value. Returns the value of the number. The number is parsed according to the grammar rules defined in the Haskell report.

naturalOrScientific :: TokenParsing m => m (Either Integer Scientific) Source #

This token parser parses either natural or a scientific. Returns the value of the number. This parsers deals with any overlap in the grammar rules for naturals and floats. The number is parsed according to the grammar rules defined in the Haskell report.

integerOrScientific :: forall m. TokenParsing m => m (Either Integer Scientific) Source #

This token parser is like naturalOrScientific, but handles leading - or +.

symbol :: TokenParsing m => String -> m String Source #

Token parser symbol s parses string s and skips trailing white space.

textSymbol :: TokenParsing m => Text -> m Text Source #

Token parser textSymbol t parses text s and skips trailing white space.

symbolic :: TokenParsing m => Char -> m Char Source #

Token parser symbolic s parses char s and skips trailing white space.

parens :: TokenParsing m => m a -> m a Source #

Token parser parens p parses p enclosed in parenthesis, returning the value of p.

braces :: TokenParsing m => m a -> m a Source #

Token parser braces p parses p enclosed in braces ('{' and '}'), returning the value of p.

angles :: TokenParsing m => m a -> m a Source #

Token parser angles p parses p enclosed in angle brackets ('<' and '>'), returning the value of p.

brackets :: TokenParsing m => m a -> m a Source #

Token parser brackets p parses p enclosed in brackets ('[' and ']'), returning the value of p.

comma :: TokenParsing m => m Char Source #

Token parser comma parses the character ',' and skips any trailing white space. Returns the string ",".

colon :: TokenParsing m => m Char Source #

Token parser colon parses the character ':' and skips any trailing white space. Returns the string ":".

dot :: TokenParsing m => m Char Source #

Token parser dot parses the character '.' and skips any trailing white space. Returns the string ".".

semiSep :: TokenParsing m => m a -> m [a] Source #

Token parser semiSep p parses zero or more occurrences of p separated by semi. Returns a list of values returned by p.

semiSep1 :: TokenParsing m => m a -> m [a] Source #

Token parser semiSep1 p parses one or more occurrences of p separated by semi. Returns a list of values returned by p.

commaSep :: TokenParsing m => m a -> m [a] Source #

Token parser commaSep p parses zero or more occurrences of p separated by comma. Returns a list of values returned by p.

commaSep1 :: TokenParsing m => m a -> m [a] Source #

Token parser commaSep1 p parses one or more occurrences of p separated by comma. Returns a list of values returned by p.

Token Parsing Class

class CharParsing m => TokenParsing m where Source #

Additional functionality that is needed to tokenize input while ignoring whitespace.

Methods

someSpace :: m () Source #

Usually, someSpace consists of one or more occurrences of a space. Some parsers may choose to recognize line comments or block (multi line) comments as white space as well.

nesting :: m a -> m a Source #

Called when we enter a nested pair of symbols. Overloadable to enable disabling layout

semi :: m Char Source #

The token parser |semi| parses the character ';' and skips any trailing white space. Returns the character ';'. Overloadable to permit automatic semicolon insertion or Haskell-style layout.

highlight :: Highlight -> m a -> m a Source #

Tag a region of parsed text with a bit of semantic information. Most parsers won't use this, but it is indispensible for highlighters.

token :: m a -> m a Source #

token p first applies parser p and then the whiteSpace parser, returning the value of p. Every lexical token (token) is defined using token, this way every parse starts at a point without white space. Parsers that use token are called token parsers in this document.

The only point where the whiteSpace parser should be called explicitly is the start of the main parser in order to skip any leading white space.

Alternatively, one might define token as first parsing whiteSpace and then parser p. By parsing whiteSpace first, the parser is able to return before parsing additional whiteSpace, improving laziness.

mainParser  = sum <$ whiteSpace <*> many (token digit) <* eof

Instances

TokenParsing ReadP Source # 
Chunk t => TokenParsing (Parser t) Source # 
TokenParsing m => TokenParsing (Unlined m) Source # 
TokenParsing m => TokenParsing (Unspaced m) Source # 
TokenParsing m => TokenParsing (Unhighlighted m) Source # 
(TokenParsing m, MonadPlus m) => TokenParsing (StateT s m) Source # 

Methods

someSpace :: StateT s m () Source #

nesting :: StateT s m a -> StateT s m a Source #

semi :: StateT s m Char Source #

highlight :: Highlight -> StateT s m a -> StateT s m a Source #

token :: StateT s m a -> StateT s m a Source #

(TokenParsing m, MonadPlus m) => TokenParsing (StateT s m) Source # 

Methods

someSpace :: StateT s m () Source #

nesting :: StateT s m a -> StateT s m a Source #

semi :: StateT s m Char Source #

highlight :: Highlight -> StateT s m a -> StateT s m a Source #

token :: StateT s m a -> StateT s m a Source #

(TokenParsing m, MonadPlus m, Monoid w) => TokenParsing (WriterT w m) Source # 

Methods

someSpace :: WriterT w m () Source #

nesting :: WriterT w m a -> WriterT w m a Source #

semi :: WriterT w m Char Source #

highlight :: Highlight -> WriterT w m a -> WriterT w m a Source #

token :: WriterT w m a -> WriterT w m a Source #

(TokenParsing m, MonadPlus m, Monoid w) => TokenParsing (WriterT w m) Source # 

Methods

someSpace :: WriterT w m () Source #

nesting :: WriterT w m a -> WriterT w m a Source #

semi :: WriterT w m Char Source #

highlight :: Highlight -> WriterT w m a -> WriterT w m a Source #

token :: WriterT w m a -> WriterT w m a Source #

(TokenParsing m, MonadPlus m) => TokenParsing (IdentityT * m) Source # 
(TokenParsing m, MonadPlus m) => TokenParsing (ReaderT * e m) Source # 

Methods

someSpace :: ReaderT * e m () Source #

nesting :: ReaderT * e m a -> ReaderT * e m a Source #

semi :: ReaderT * e m Char Source #

highlight :: Highlight -> ReaderT * e m a -> ReaderT * e m a Source #

token :: ReaderT * e m a -> ReaderT * e m a Source #

Stream s m Char => TokenParsing (ParsecT s u m) Source # 

Methods

someSpace :: ParsecT s u m () Source #

nesting :: ParsecT s u m a -> ParsecT s u m a Source #

semi :: ParsecT s u m Char Source #

highlight :: Highlight -> ParsecT s u m a -> ParsecT s u m a Source #

token :: ParsecT s u m a -> ParsecT s u m a Source #

(TokenParsing m, MonadPlus m, Monoid w) => TokenParsing (RWST r w s m) Source # 

Methods

someSpace :: RWST r w s m () Source #

nesting :: RWST r w s m a -> RWST r w s m a Source #

semi :: RWST r w s m Char Source #

highlight :: Highlight -> RWST r w s m a -> RWST r w s m a Source #

token :: RWST r w s m a -> RWST r w s m a Source #

(TokenParsing m, MonadPlus m, Monoid w) => TokenParsing (RWST r w s m) Source # 

Methods

someSpace :: RWST r w s m () Source #

nesting :: RWST r w s m a -> RWST r w s m a Source #

semi :: RWST r w s m Char Source #

highlight :: Highlight -> RWST r w s m a -> RWST r w s m a Source #

token :: RWST r w s m a -> RWST r w s m a Source #

Token Parsing Transformers

newtype Unspaced m a Source #

This is a parser transformer you can use to disable the automatic trailing space consumption of a Token parser.

Constructors

Unspaced 

Fields

Instances

MonadTrans Unspaced Source # 

Methods

lift :: Monad m => m a -> Unspaced m a #

MonadReader e m => MonadReader e (Unspaced m) Source # 

Methods

ask :: Unspaced m e #

local :: (e -> e) -> Unspaced m a -> Unspaced m a #

reader :: (e -> a) -> Unspaced m a #

MonadState s m => MonadState s (Unspaced m) Source # 

Methods

get :: Unspaced m s #

put :: s -> Unspaced m () #

state :: (s -> (a, s)) -> Unspaced m a #

MonadWriter e m => MonadWriter e (Unspaced m) Source # 

Methods

writer :: (a, e) -> Unspaced m a #

tell :: e -> Unspaced m () #

listen :: Unspaced m a -> Unspaced m (a, e) #

pass :: Unspaced m (a, e -> e) -> Unspaced m a #

Monad m => Monad (Unspaced m) Source # 

Methods

(>>=) :: Unspaced m a -> (a -> Unspaced m b) -> Unspaced m b #

(>>) :: Unspaced m a -> Unspaced m b -> Unspaced m b #

return :: a -> Unspaced m a #

fail :: String -> Unspaced m a #

Functor m => Functor (Unspaced m) Source # 

Methods

fmap :: (a -> b) -> Unspaced m a -> Unspaced m b #

(<$) :: a -> Unspaced m b -> Unspaced m a #

Applicative m => Applicative (Unspaced m) Source # 

Methods

pure :: a -> Unspaced m a #

(<*>) :: Unspaced m (a -> b) -> Unspaced m a -> Unspaced m b #

(*>) :: Unspaced m a -> Unspaced m b -> Unspaced m b #

(<*) :: Unspaced m a -> Unspaced m b -> Unspaced m a #

Alternative m => Alternative (Unspaced m) Source # 

Methods

empty :: Unspaced m a #

(<|>) :: Unspaced m a -> Unspaced m a -> Unspaced m a #

some :: Unspaced m a -> Unspaced m [a] #

many :: Unspaced m a -> Unspaced m [a] #

MonadPlus m => MonadPlus (Unspaced m) Source # 

Methods

mzero :: Unspaced m a #

mplus :: Unspaced m a -> Unspaced m a -> Unspaced m a #

Parsing m => Parsing (Unspaced m) Source # 
CharParsing m => CharParsing (Unspaced m) Source # 
TokenParsing m => TokenParsing (Unspaced m) Source # 

newtype Unlined m a Source #

This is a parser transformer you can use to disable the automatic trailing newline (but not whitespace-in-general) consumption of a Token parser.

Constructors

Unlined 

Fields

Instances

MonadTrans Unlined Source # 

Methods

lift :: Monad m => m a -> Unlined m a #

MonadReader e m => MonadReader e (Unlined m) Source # 

Methods

ask :: Unlined m e #

local :: (e -> e) -> Unlined m a -> Unlined m a #

reader :: (e -> a) -> Unlined m a #

MonadState s m => MonadState s (Unlined m) Source # 

Methods

get :: Unlined m s #

put :: s -> Unlined m () #

state :: (s -> (a, s)) -> Unlined m a #

MonadWriter e m => MonadWriter e (Unlined m) Source # 

Methods

writer :: (a, e) -> Unlined m a #

tell :: e -> Unlined m () #

listen :: Unlined m a -> Unlined m (a, e) #

pass :: Unlined m (a, e -> e) -> Unlined m a #

Monad m => Monad (Unlined m) Source # 

Methods

(>>=) :: Unlined m a -> (a -> Unlined m b) -> Unlined m b #

(>>) :: Unlined m a -> Unlined m b -> Unlined m b #

return :: a -> Unlined m a #

fail :: String -> Unlined m a #

Functor m => Functor (Unlined m) Source # 

Methods

fmap :: (a -> b) -> Unlined m a -> Unlined m b #

(<$) :: a -> Unlined m b -> Unlined m a #

Applicative m => Applicative (Unlined m) Source # 

Methods

pure :: a -> Unlined m a #

(<*>) :: Unlined m (a -> b) -> Unlined m a -> Unlined m b #

(*>) :: Unlined m a -> Unlined m b -> Unlined m b #

(<*) :: Unlined m a -> Unlined m b -> Unlined m a #

Alternative m => Alternative (Unlined m) Source # 

Methods

empty :: Unlined m a #

(<|>) :: Unlined m a -> Unlined m a -> Unlined m a #

some :: Unlined m a -> Unlined m [a] #

many :: Unlined m a -> Unlined m [a] #

MonadPlus m => MonadPlus (Unlined m) Source # 

Methods

mzero :: Unlined m a #

mplus :: Unlined m a -> Unlined m a -> Unlined m a #

Parsing m => Parsing (Unlined m) Source # 

Methods

try :: Unlined m a -> Unlined m a Source #

(<?>) :: Unlined m a -> String -> Unlined m a Source #

skipMany :: Unlined m a -> Unlined m () Source #

skipSome :: Unlined m a -> Unlined m () Source #

unexpected :: String -> Unlined m a Source #

eof :: Unlined m () Source #

notFollowedBy :: Show a => Unlined m a -> Unlined m () Source #

CharParsing m => CharParsing (Unlined m) Source # 
TokenParsing m => TokenParsing (Unlined m) Source # 

newtype Unhighlighted m a Source #

This is a parser transformer you can use to disable syntax highlighting over a range of text you are parsing.

Constructors

Unhighlighted 

Fields

Instances

MonadTrans Unhighlighted Source # 

Methods

lift :: Monad m => m a -> Unhighlighted m a #

MonadReader e m => MonadReader e (Unhighlighted m) Source # 

Methods

ask :: Unhighlighted m e #

local :: (e -> e) -> Unhighlighted m a -> Unhighlighted m a #

reader :: (e -> a) -> Unhighlighted m a #

MonadState s m => MonadState s (Unhighlighted m) Source # 

Methods

get :: Unhighlighted m s #

put :: s -> Unhighlighted m () #

state :: (s -> (a, s)) -> Unhighlighted m a #

MonadWriter e m => MonadWriter e (Unhighlighted m) Source # 

Methods

writer :: (a, e) -> Unhighlighted m a #

tell :: e -> Unhighlighted m () #

listen :: Unhighlighted m a -> Unhighlighted m (a, e) #

pass :: Unhighlighted m (a, e -> e) -> Unhighlighted m a #

Monad m => Monad (Unhighlighted m) Source # 

Methods

(>>=) :: Unhighlighted m a -> (a -> Unhighlighted m b) -> Unhighlighted m b #

(>>) :: Unhighlighted m a -> Unhighlighted m b -> Unhighlighted m b #

return :: a -> Unhighlighted m a #

fail :: String -> Unhighlighted m a #

Functor m => Functor (Unhighlighted m) Source # 

Methods

fmap :: (a -> b) -> Unhighlighted m a -> Unhighlighted m b #

(<$) :: a -> Unhighlighted m b -> Unhighlighted m a #

Applicative m => Applicative (Unhighlighted m) Source # 

Methods

pure :: a -> Unhighlighted m a #

(<*>) :: Unhighlighted m (a -> b) -> Unhighlighted m a -> Unhighlighted m b #

(*>) :: Unhighlighted m a -> Unhighlighted m b -> Unhighlighted m b #

(<*) :: Unhighlighted m a -> Unhighlighted m b -> Unhighlighted m a #

Alternative m => Alternative (Unhighlighted m) Source # 
MonadPlus m => MonadPlus (Unhighlighted m) Source # 
Parsing m => Parsing (Unhighlighted m) Source # 
CharParsing m => CharParsing (Unhighlighted m) Source # 
TokenParsing m => TokenParsing (Unhighlighted m) Source # 

Non-Token Parsers

decimal :: TokenParsing m => m Integer Source #

Parses a non-negative whole number in the decimal system. Returns the value of the number.

This parser does NOT swallow trailing whitespace

hexadecimal :: TokenParsing m => m Integer Source #

Parses a non-negative whole number in the hexadecimal system. The number should be prefixed with "x" or "X". Returns the value of the number.

This parser does NOT swallow trailing whitespace

octal :: TokenParsing m => m Integer Source #

Parses a non-negative whole number in the octal system. The number should be prefixed with "o" or "O". Returns the value of the number.

This parser does NOT swallow trailing whitespace

characterChar :: TokenParsing m => m Char Source #

This parser parses a character literal without the surrounding quotation marks.

This parser does NOT swallow trailing whitespace

integer' :: TokenParsing m => m Integer Source #

This parser parses an integer (a whole number). This parser is like natural except that it can be prefixed with sign (i.e. '-' or '+'). Returns the value of the number. The number can be specified in decimal, hexadecimal or octal. The number is parsed according to the grammar rules in the Haskell report.

This parser does NOT swallow trailing whitespace.

Also, unlike the integer parser, this parser does not admit spaces between the sign and the number.

Identifiers

data IdentifierStyle m Source #

Used to describe an input style for constructors, values, operators, etc.

liftIdentifierStyle :: (MonadTrans t, Monad m) => IdentifierStyle m -> IdentifierStyle (t m) Source #

Lift an identifier style into a monad transformer

Using over from the lens package:

liftIdentifierStyle = over styleChars lift

ident :: (TokenParsing m, Monad m, IsString s) => IdentifierStyle m -> m s Source #

Parse a non-reserved identifier or symbol

reserve :: (TokenParsing m, Monad m) => IdentifierStyle m -> String -> m () Source #

parse a reserved operator or identifier using a given style

reserveText :: (TokenParsing m, Monad m) => IdentifierStyle m -> Text -> m () Source #

parse a reserved operator or identifier using a given style given Text.

Lenses and Traversals

styleName :: Functor f => (String -> f String) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This lens can be used to update the name for this style of identifier.

styleName :: Lens' (IdentifierStyle m) String

styleStart :: Functor f => (m Char -> f (m Char)) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This lens can be used to update the action used to recognize the first letter in an identifier.

styleStart :: Lens' (IdentifierStyle m) (m Char)

styleLetter :: Functor f => (m Char -> f (m Char)) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This lens can be used to update the action used to recognize subsequent letters in an identifier.

styleLetter :: Lens' (IdentifierStyle m) (m Char)

styleChars :: Applicative f => (m Char -> f (n Char)) -> IdentifierStyle m -> f (IdentifierStyle n) Source #

This is a traversal of both actions in contained in an IdentifierStyle.

styleChars :: Traversal (IdentifierStyle m) (IdentifierStyle n) (m Char) (n Char)

styleReserved :: Functor f => (HashSet String -> f (HashSet String)) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This is a lens that can be used to modify the reserved identifier set.

styleReserved :: Lens' (IdentifierStyle m) (HashSet String)

styleHighlight :: Functor f => (Highlight -> f Highlight) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This is a lens that can be used to modify the highlight used for this identifier set.

styleHighlight :: Lens' (IdentifierStyle m) Highlight

styleReservedHighlight :: Functor f => (Highlight -> f Highlight) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This is a lens that can be used to modify the highlight used for reserved identifiers in this identifier set.

styleReservedHighlight :: Lens' (IdentifierStyle m) Highlight

styleHighlights :: Applicative f => (Highlight -> f Highlight) -> IdentifierStyle m -> f (IdentifierStyle m) Source #

This is a traversal that can be used to modify the highlights used for both non-reserved and reserved identifiers in this identifier set.

styleHighlights :: Traversal' (IdentifierStyle m) Highlight