Encode: Encoding character data

[ codec, library, program, text ] [ Propose Tags ]

The Encode library provides a unified interface for converting strings from different encodings into a common representation, and vice versa. This representation is isomorphic to the Unicode character set, and the encodings can be both standard and user-defined. For this purpose, the Encode module defines the Encode.UPoint data type and the Encode.Encoding type class with the encode and decode methods.

The Encode library is being proposed as a Haskell analogy to the Encode extension in Perl, http://search.cpan.org/dist/Encode/.

The Exec.Encode.Main and Exec.Decode.Main programs mimick the function calls to encode and decode, respectively, with the following usage of the executables:

   decode ArabTeX < decode.d | encode Buckwalter > encode.d

   decode MacArabic < data.MacArabic > data.UTF8

   encode WinArabic < data.UTF8 > data.WinArabic

The installation instructions are given in INSTALL. For the list of supported encoding schemes, please refer to Encode and the source files of the programs. With the executables, the naming of encodings is case-insensitive and some further options are allowed:

 encode|decode [--OPTIONS] [ENCODING]
   -h       --help         show usage information
   -l       --lines        use line-oriented mode
   -p text  --prefix=text  prefix input with text
   -s text  --suffix=text  suffix input with text
   -v       --version      show program's version

The PureFP library is an edited excerpt from the Functional Parsing library developed by Peter Ljunglöf in his licenciate thesis Pure Functional Parsing – an advanced tutorial, Göteborg University and Chalmers University of Technology, April 2002, http://www.ling.gu.se/~peb/software.html.

The Parsek library is included for experimental purposes and subsumes PureFP.Parsers.Stream. Parsek was developed by Koen Claessen in his functional pearl article Parallel Parsing Processes, Journal of Functional Programming, 14(6), 741–757, Cambridge University Press, 2004, http://www.cs.chalmers.se/~koen/pubs/entry-jfp04-parser.html.

Encode PureFP Parsek

Modules

[Last Documentation]

  • Encode
    • Encode.Arabic
      • Encode.Arabic.ArabTeX
        • Encode.Arabic.ArabTeX.ZDMG
      • Encode.Arabic.Buckwalter
      • Encode.Arabic.Byte
    • Encode.ExPlus
    • Encode.Extend
    • Encode.Mapper
    • Encode.Unicode
      • Encode.Unicode.UTF8
    • Encode.Version
  • Parsek
  • PureFP
    • PureFP.OrdMap
    • PureFP.OrdSet
    • PureFP.Parsers
      • PureFP.Parsers.AmbExTrie
      • PureFP.Parsers.AmbTrie
      • PureFP.Parsers.ExTrie
      • PureFP.Parsers.PairTrie
      • PureFP.Parsers.Parser
      • PureFP.Parsers.Standard
      • PureFP.Parsers.Stream
      • PureFP.Parsers.Trie

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.7, 1.1, 1.2, 1.3.1, 1.3.2, 1.3.3, 1.3.4, 1.3.5, 1.3.6, 1.3.7, 1.3.8
Dependencies base (<4.4), Cabal, containers, mtl [details]
License LicenseRef-GPL
Copyright 2011
Author Otakar Smrz
Maintainer otakar.smrz mff.cuni.cz
Category Text, Codec
Home page http://ufal.mff.cuni.cz/~smrz/
Uploaded by OtakarSmrz at 2011-08-21T23:08:36Z
Distributions
Reverse Dependencies 2 direct, 0 indirect [details]
Executables decode, encode
Downloads 9892 total (26 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2016-12-27 [all 6 reports]