The Encode package
The Encode library provides a unified interface for converting strings from different encodings into a common representation, and vice versa. This representation is isomorphic to the Unicode character set, and the encodings can be both standard and user-defined. For this purpose, the Encode module defines the Encode.UPoint data type and the Encode.Encoding type class with the encode and decode methods.
The Encode library is being proposed as a Haskell analogy to the Encode extension in Perl, http://search.cpan.org/dist/Encode/.
The Exec.Encode.Main and Exec.Decode.Main programs mimick the function calls to encode and decode, respectively, with the following usage of the executables:
decode ArabTeX < decode.d | encode Buckwalter > encode.d
decode MacArabic < data.MacArabic > data.UTF8
encode WinArabic < data.UTF8 > data.WinArabic
The installation instructions are given in INSTALL. For the list of supported encoding schemes, please refer to Encode and the source files of the programs. With the executables, the naming of encodings is case-insensitive and some further options are allowed:
encode|decode [--OPTIONS] [ENCODING]
-h --help show usage information
-l --lines use line-oriented mode
-p text --prefix=text prefix input with text
-s text --suffix=text suffix input with text
-v --version show program's version
The PureFP library is an edited excerpt from the Functional Parsing library developed by Peter Ljunglöf in his licenciate thesis Pure Functional Parsing – an advanced tutorial, Göteborg University and Chalmers University of Technology, April 2002, http://www.ling.gu.se/~peb/software.html.
The Parsek library is included for experimental purposes and subsumes PureFP.Parsers.Stream. Parsek was developed by Koen Claessen in his functional pearl article Parallel Parsing Processes, Journal of Functional Programming, 14(6), 741757, Cambridge University Press, 2004, http://www.cs.chalmers.se/~koen/pubs/entry-jfp04-parser.html.
Encode PureFP Parsek
Properties
| Versions | 0.7, 1.1, 1.2, 1.3.1, 1.3.2, 1.3.3, 1.3.4, 1.3.5 |
|---|---|
| Dependencies | base (<4.4), Cabal, containers, mtl |
| License | GPL |
| Copyright | 2011 |
| Author | Otakar Smrz |
| Maintainer | otakar.smrz mff.cuni.cz |
| Category | Text, Codec |
| Home page | http://ufal.mff.cuni.cz/~smrz/ |
| Executables | decode, encode |
| Upload date | Sun Aug 21 23:08:36 UTC 2011 |
| Uploaded by | OtakarSmrz |
| Build failure | ghc-7.2 (log), ghc-7.4 (log) |
Modules
- Encode
- Encode.Arabic
- Encode.Arabic.ArabTeX
- Encode.Arabic.ArabTeX.ZDMG
- Encode.Arabic.Buckwalter
- Encode.Arabic.Byte
- Encode.Arabic.ArabTeX
- Encode.ExPlus
- Encode.Extend
- Encode.Mapper
- Encode.Unicode
- Encode.Unicode.UTF8
- Encode.Version
- Encode.Arabic
- Parsek
- PureFP
- PureFP.OrdMap
- PureFP.OrdSet
- PureFP.Parsers
- PureFP.Parsers.AmbExTrie
- PureFP.Parsers.AmbTrie
- PureFP.Parsers.ExTrie
- PureFP.Parsers.PairTrie
- PureFP.Parsers.Parser
- PureFP.Parsers.Standard
- PureFP.Parsers.Stream
- PureFP.Parsers.Trie
Downloads
- Encode-1.3.5.tar.gz (Cabal source package)
- package description (included in the package)