text-1.1.0.0: An efficient packed Unicode text type.

PortabilityGHC
Stabilityexperimental
Maintainerbos@serpentine.com
Safe HaskellNone

Data.Text.Internal.Fusion.Common

Contents

Description

Warning: this is an internal module, and does not have a stable API or name. Functions in this module may not check or enforce preconditions expected by public modules. Use at your own risk!

Common stream fusion functionality for text.

Synopsis

Creation and elimination

streamCString# :: Addr# -> Stream CharSource

Stream the UTF-8-like packed encoding used by GHC to represent constant strings in generated code.

This encoding uses the byte sequence xc0x80 to represent NUL, and the string is NUL-terminated.

Basic interface

cons :: Char -> Stream Char -> Stream CharSource

O(n) Adds a character to the front of a Stream Char.

snoc :: Stream Char -> Char -> Stream CharSource

O(n) Adds a character to the end of a stream.

append :: Stream Char -> Stream Char -> Stream CharSource

O(n) Appends one Stream to the other.

head :: Stream Char -> CharSource

O(1) Returns the first character of a Text, which must be non-empty. Subject to array fusion.

uncons :: Stream Char -> Maybe (Char, Stream Char)Source

O(1) Returns the first character and remainder of a 'Stream Char', or Nothing if empty. Subject to array fusion.

last :: Stream Char -> CharSource

O(n) Returns the last character of a 'Stream Char', which must be non-empty.

tail :: Stream Char -> Stream CharSource

O(1) Returns all characters after the head of a Stream Char, which must be non-empty.

init :: Stream Char -> Stream CharSource

O(1) Returns all but the last character of a Stream Char, which must be non-empty.

null :: Stream Char -> BoolSource

O(1) Tests whether a Stream Char is empty or not.

lengthI :: Integral a => Stream Char -> aSource

O(n) Returns the number of characters in a string.

compareLengthI :: Integral a => Stream Char -> a -> OrderingSource

O(n) Compares the count of characters in a string to a number. Subject to fusion.

This function gives the same answer as comparing against the result of lengthI, but can short circuit if the count of characters is greater than the number, and hence be more efficient.

isSingleton :: Stream Char -> BoolSource

O(n) Indicate whether a string contains exactly one element.

Transformations

map :: (Char -> Char) -> Stream Char -> Stream CharSource

O(n) map f xs is the Stream Char obtained by applying f to each element of xs.

intersperse :: Char -> Stream Char -> Stream CharSource

O(n) Take a character and place it between each of the characters of a 'Stream Char'.

Case conversion

With Unicode text, it is incorrect to use combinators like map toUpper to case convert each character of a string individually. Instead, use the whole-string case conversion functions from this module. For correctness in different writing systems, these functions may map one input character to two or three output characters.

toCaseFold :: Stream Char -> Stream CharSource

O(n) Convert a string to folded case. This function is mainly useful for performing caseless (or case insensitive) string comparisons.

A string x is a caseless match for a string y if and only if:

toCaseFold x == toCaseFold y

The result string may be longer than the input string, and may differ from applying toLower to the input string. For instance, the Armenian small ligature men now (U+FB13) is case folded to the bigram men now (U+0574 U+0576), while the micro sign (U+00B5) is case folded to the Greek small letter letter mu (U+03BC) instead of itself.

toLower :: Stream Char -> Stream CharSource

O(n) Convert a string to lower case, using simple case conversion. The result string may be longer than the input string. For instance, the Latin capital letter I with dot above (U+0130) maps to the sequence Latin small letter i (U+0069) followed by combining dot above (U+0307).

toTitle :: Stream Char -> Stream CharSource

O(n) Convert a string to title case, using simple case conversion.

The first letter of the input is converted to title case, as is every subsequent letter that immediately follows a non-letter. Every letter that immediately follows another letter is converted to lower case.

The result string may be longer than the input string. For example, the Latin small ligature fl (U+FB02) is converted to the sequence Latin capital letter F (U+0046) followed by Latin small letter l (U+006C).

Note: this function does not take language or culture specific rules into account. For instance, in English, different style guides disagree on whether the book name "The Hill of the Red Fox" is correctly title cased—but this function will capitalize every word.

toUpper :: Stream Char -> Stream CharSource

O(n) Convert a string to upper case, using simple case conversion. The result string may be longer than the input string. For instance, the German eszett (U+00DF) maps to the two-letter sequence SS.

Justification

Folds

foldl :: (b -> Char -> b) -> b -> Stream Char -> bSource

foldl, applied to a binary operator, a starting value (typically the left-identity of the operator), and a Stream, reduces the Stream using the binary operator, from left to right.

foldl' :: (b -> Char -> b) -> b -> Stream Char -> bSource

A strict version of foldl.

foldl1 :: (Char -> Char -> Char) -> Stream Char -> CharSource

foldl1 is a variant of foldl that has no starting value argument, and thus must be applied to non-empty Streams.

foldl1' :: (Char -> Char -> Char) -> Stream Char -> CharSource

A strict version of foldl1.

foldr :: (Char -> b -> b) -> b -> Stream Char -> bSource

foldr, applied to a binary operator, a starting value (typically the right-identity of the operator), and a stream, reduces the stream using the binary operator, from right to left.

foldr1 :: (Char -> Char -> Char) -> Stream Char -> CharSource

foldr1 is a variant of foldr that has no starting value argument, and thus must be applied to non-empty streams. Subject to array fusion.

Special folds

concat :: [Stream Char] -> Stream CharSource

O(n) Concatenate a list of streams. Subject to array fusion.

concatMap :: (Char -> Stream Char) -> Stream Char -> Stream CharSource

Map a function over a stream that results in a stream and concatenate the results.

any :: (Char -> Bool) -> Stream Char -> BoolSource

O(n) any p xs determines if any character in the stream xs satisifes the predicate p.

all :: (Char -> Bool) -> Stream Char -> BoolSource

O(n) all p xs determines if all characters in the Text xs satisify the predicate p.

maximum :: Stream Char -> CharSource

O(n) maximum returns the maximum value from a stream, which must be non-empty.

minimum :: Stream Char -> CharSource

O(n) minimum returns the minimum value from a Text, which must be non-empty.

Construction

Scans

Accumulating maps

Generation and unfolding

unfoldr :: (a -> Maybe (Char, a)) -> a -> Stream CharSource

O(n), where n is the length of the result. The unfoldr function is analogous to the List unfoldr. unfoldr builds a stream from a seed value. The function takes the element and returns Nothing if it is done producing the stream or returns Just (a,b), in which case, a is the next Char in the string, and b is the seed value for further production.

unfoldrNI :: Integral a => a -> (b -> Maybe (Char, b)) -> b -> Stream CharSource

O(n) Like unfoldr, unfoldrNI builds a stream from a seed value. However, the length of the result is limited by the first argument to unfoldrNI. This function is more efficient than unfoldr when the length of the result is known.

Substrings

Breaking strings

take :: Integral a => a -> Stream Char -> Stream CharSource

O(n) take n, applied to a stream, returns the prefix of the stream of length n, or the stream itself if n is greater than the length of the stream.

drop :: Integral a => a -> Stream Char -> Stream CharSource

O(n) drop n, applied to a stream, returns the suffix of the stream after the first n characters, or the empty stream if n is greater than the length of the stream.

takeWhile :: (Char -> Bool) -> Stream Char -> Stream CharSource

takeWhile, applied to a predicate p and a stream, returns the longest prefix (possibly empty) of elements that satisfy p.

dropWhile :: (Char -> Bool) -> Stream Char -> Stream CharSource

dropWhile p xs returns the suffix remaining after takeWhile p xs.

Predicates

isPrefixOf :: Eq a => Stream a -> Stream a -> BoolSource

O(n) The isPrefixOf function takes two Streams and returns True iff the first is a prefix of the second.

Searching

elem :: Char -> Stream Char -> BoolSource

O(n) elem is the stream membership predicate.

filter :: (Char -> Bool) -> Stream Char -> Stream CharSource

O(n) filter, applied to a predicate and a stream, returns a stream containing those characters that satisfy the predicate.

Indexing

findBy :: (Char -> Bool) -> Stream Char -> Maybe CharSource

O(n) The findBy function takes a predicate and a stream, and returns the first element in matching the predicate, or Nothing if there is no such element.

indexI :: Integral a => Stream Char -> a -> CharSource

O(n) Stream index (subscript) operator, starting from 0.

findIndexI :: Integral a => (Char -> Bool) -> Stream Char -> Maybe aSource

The findIndexI function takes a predicate and a stream and returns the index of the first element in the stream satisfying the predicate.

countCharI :: Integral a => Char -> Stream Char -> aSource

O(n) The countCharI function returns the number of times the query element appears in the given stream.

Zipping and unzipping

zipWith :: (a -> a -> b) -> Stream a -> Stream a -> Stream bSource

zipWith generalises zip by zipping with the function given as the first argument, instead of a tupling function.