Portability | portable |
---|---|
Stability | experimental |
Maintainer | ekmett@gmail.com |
Safe Haskell | Safe-Infered |
Provides unicode general categories, which are typically connoted by
p{Ll}
or p{Modifier_Letter}
. Lookups can be constructed using categories
or individual character sets can be used directly.
A case, _
and -
insensitive lookup is provided by lookupCategory
and can be used to provide behavior similar to that of Perl or PCRE.
- data Category = Category {}
- categories :: [Category]
- lookupCategory :: String -> Maybe Category
- lookupCategoryCharSet :: String -> Maybe CharSet
- lowercaseLetter, letter, otherLetter, modifierLetter, letterAnd, titlecaseLetter, uppercaseLetter :: CharSet
- nonSpacingMark, mark, enclosingMark, spacingCombiningMark :: CharSet
- space, separator, paragraphSeparator, lineSeparator :: CharSet
- mathSymbol, symbol, otherSymbol, modifierSymbol, currencySymbol :: CharSet
- decimalNumber, number, otherNumber, letterNumber :: CharSet
- dashPunctuation, punctuation, otherPunctuation, connectorPunctuation, finalQuote, initialQuote, closePunctuation, openPunctuation :: CharSet
- control, other, notAssigned, surrogate, privateUse, format :: CharSet
Unicode General Category
Lookup
categories :: [Category]Source
CharSets by Category
Letter
lowercaseLetter, letter, otherLetter, modifierLetter, letterAnd, titlecaseLetter, uppercaseLetter :: CharSetSource
Letter&
Mark
Separator
Symbol
Number
Punctuation
dashPunctuation, punctuation, otherPunctuation, connectorPunctuation, finalQuote, initialQuote, closePunctuation, openPunctuation :: CharSetSource