Encode: Encoding character data

[ codec, library, program, text ] [ Propose Tags ]

The Encode library provides a unified interface for converting strings from different encodings into a common representation, and vice versa. This representation is isomorphic to the Unicode character set, and the encodings can be both standard and user-defined. For this purpose, the Encode module defines the Encode.UPoint data type and the Encode.Encoding type class with the encode and decode methods.

The Encode library is being proposed as a Haskell analogy to the Encode extension in Perl, http://search.cpan.org/dist/Encode/.

The Exec.Encode.Main and Exec.Decode.Main programs mimick the fuction calls to encode and decode, respectively, with the following usage of the executables:

   decode ArabTeX < decode.d | encode Buckwalter > encode.d

   decode MacArabic < data.MacArabic > data.UTF8

   encode WinArabic < data.UTF8 > data.WinArabic

The installation instructions are given in INSTALL. For the list of supported encoding schemes, please refer to Encode and the source files of the programs. With the executables, the naming of encodings is case-insensitive and some further options are allowed:

 encode|decode [--OPTIONS] [ENCODING]
   -h       --help         show usage information
   -l       --lines        use line-oriented mode
   -p text  --prefix=text  prefix input with text
   -s text  --suffix=text  suffix input with text
   -v       --version      show program's version

The PureFP library is an edited excerpt from the Functional Parsing library developed by Peter Ljunglöf in his licenciate thesis /Pure Functional Parsing – an advanced tutorial/, Göteborg University and Chalmers University of Technology, April 2002, http://www.ling.gu.se/~peb/software.html.

The Parsek library is included for experimental purposes and subsumes PureFP.Parsers.Stream. Parsek was developed by Koen Claessen in his functional pearl article /Parallel Parsing Processes/, Journal of Functional Programming, 14(6), 741–757, Cambridge University Press, 2004, http://www.cs.chalmers.se/~koen/pubs/entry-jfp04-parser.html

The Version library is just a simple support for working with the CVS/SVN revision keyword.

This software is published under the GNU General Public License. Only the Encode.hs and Version.hs files are instead subject to the Revised BSD License. Note the copyright and license details in the headers of the files, and see LICENSE, LicenseBSD and LicenseGPL distributed with this package.

Encode PureFP Parsek Version

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.7, 1.1, 1.2, 1.3.1, 1.3.2, 1.3.3, 1.3.4, 1.3.5, 1.3.6, 1.3.7, 1.3.8
Dependencies base (<=4.2), Cabal, mtl [details]
License LicenseRef-GPL
Copyright 2010
Author Otakar Smrz
Maintainer otakar.smrz mff.cuni.cz
Category Text, Codec
Home page http://ufal.mff.cuni.cz/~smrz/
Uploaded by OtakarSmrz at 2010-06-19T23:03:06Z
Distributions
Reverse Dependencies 2 direct, 0 indirect [details]
Executables decode, encode
Downloads 9855 total (25 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]