unicode-transforms: Unicode transforms (normalization NFC/NFD/NFKC/NFKD)

[ bsd3, data, library, text, unicode ] [ Propose Tags ]

This is a lightweight library supporting a limited set of unicode transformations (only normalizations as of now) on ByteStrings (UTF-8) and Text without requiring any other system libraries. It is based on the utf8proc C utility supporting unicode versions upto 5.1.0.

text-icu is a full featured alternative for all unicode operations but with a dependency on the system installed icu libraries. This package aims to provide an API similar to text-icu.

For more details see the README.md file.


[Skip to Readme]
Versions [faq] 0.1.0.1, 0.2.0, 0.2.1, 0.3.0, 0.3.1, 0.3.2, 0.3.3, 0.3.4, 0.3.5
Dependencies base (>=4.7 && <5), bytestring, text [details]
License BSD-3-Clause
Copyright 2016 Harendra Kumar
Author Harendra Kumar
Maintainer harendra.kumar@gmail.com
Category Data, Text, Unicode
Home page http://github.com/harendra-kumar/unicode-transforms
Source repo head: git clone https://github.com/harendra-kumar/unicode-transforms
Uploaded by harendra at Mon Jun 20 16:06:49 UTC 2016
Distributions Arch:0.3.5, Debian:0.3.4, LTSHaskell:0.3.5, NixOS:0.3.5, Stackage:0.3.5, openSUSE:0.3.5
Downloads 7064 total (457 in the last 30 days)
Rating (no votes yet) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Hackage Matrix CI
Docs available [build log]
Last success reported on 2016-06-20 [all 1 reports]

Modules

[Index]

Flags

NameDescriptionDefaultType
bench-icu

Use text-icu for benchmark comparison

DisabledManual

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Maintainer's Corner

For package maintainers and hackage trustees


Readme for unicode-transforms-0.1.0.1

[back to package description]

Unicode Transforms

This is a lightweight Haskell library supporting commonly used unicode transformations (currently only normalizations) on ByteStrings (UTF-8) and Text.

Haskell package text-icu provides a comprehensive set of unicode transforms. The drawback of text-icu is that it requires you to install the ICU library OS packages first. This package is self contained and aims to provide an API similar to text-icu so that it can be used as a drop-in replacement for the features it supports.

Features

Unicode normalization in NFC, NFKC, NFD, NFKD forms is supported. This version of the library supports unicode versions upto 5.1.0.

Documentation

Please see the haddock documentation available with the package.

Implementation

This package is implemented as bindings to the utf8proc C utility. The utf8proc version bundled with this package is taken from the xqilla project (xqilla version 2.3.2).

In future the underlying utf8proc implementation will get replaced by a Haskell implementation supporting the latest unicode versions.

Related stuff

Please see the NOTES.md file shipped with this package for more details on related packages, missing features and todo etc.

Contributing

Contributions are welcome! Please use the github repository at https://github.com/harendra-kumar/unicode-transforms to raise issues, request features or send pull requests.