name: NaturalLanguageAlphabets version: author: Christian Hoener zu Siederdissen maintainer: homepage: copyright: Christian Hoener zu Siederdissen, 2014-2015 category: Natural Language Processing license: BSD3 license-file: LICENSE build-type: Simple stability: experimental cabal-version: >= 1.10.0 tested-with: GHC == 7.8.4, GHC == 7.10.1 synopsis: Alphabet and word representations description: Provides different encoding for characters and words in natural language processing. A character will often be encoded as a unicode text string as we deal with multi-symbol characters. . Internal encoding of IMMC symbols are 0-based integers, which allows for the use of unboxed containers. . A very simple unigram-based scoring scheme is also provided. extra-source-files: scoring/simpleunigram.txt flag llvm description: build the benchmark using LLVM default: False manual: True library build-depends: base > 4.7 && < 4.9 , array >= 0.5 && < 0.6 , attoparsec >= 0.10 && < 0.13 , bimaps >= 0.0.0 && < 0.0.1 , bytestring >= 0.10.4 , deepseq >= 1.3 && < 1.5 , file-embed >= 0.0.6 && < 0.0.9 , hashable >= 1.2 && < 1.3 , hashtables >= 1.1 && < 1.3 , intern >= 0.9 && < 0.10 , stringable >= 0.1.2 && < 0.2 , system-filepath >= 0.4.9 && < 0.5 , text >= 0.11 && < 1.3 , unordered-containers >= 0.2.3 && < 0.3 , vector >= 0.10 && < 0.11 , vector-th-unbox >= 0.2 && < 0.3 exposed-modules: NLP.Alphabet.IMMC NLP.Alphabet.IMMC.Internal NLP.Alphabet.MultiChar NLP.Scoring.SimpleUnigram NLP.Scoring.SimpleUnigram.Default NLP.Scoring.SimpleUnigram.Import default-language: Haskell2010 default-extensions: BangPatterns , DeriveGeneric , DeriveDataTypeable , GeneralizedNewtypeDeriving , MultiParamTypeClasses , OverloadedStrings , RecordWildCards , TemplateHaskell , TypeFamilies ghc-options: -O2 -funbox-strict-fields benchmark BenchmarkNLA build-depends: base , containers , criterion >= 1.0.2 && < 1.1.1 , deepseq , hashtables , mwc-random >= 0.13 && < 0.14 , NaturalLanguageAlphabets , random >= 1.0 && < 1.2 , unordered-containers , vector hs-source-dirs: src main-is: Benchmark.hs type: exitcode-stdio-1.0 default-language: Haskell2010 default-extensions: BangPatterns , ScopedTypeVariables ghc-options: -O2 -rtsopts -funbox-strict-fields -funfolding-use-threshold1000 -funfolding-keeness-factor1000 if flag(llvm) ghc-options: -fllvm -optlo-O3 -optlo-std-compile-opts -fllvm-tbaa source-repository head type: git location: git://