name: NaturalLanguageAlphabets version: 0.0.2.0 author: Christian Hoener zu Siederdissen maintainer: choener@bioinf.uni-leipzig.de homepage: https://github.com/choener/NaturalLanguageAlphabets bug-reports: https://github.com/choener/NaturalLanguageAlphabets/issues copyright: Christian Hoener zu Siederdissen, 2014-2015 category: Natural Language Processing license: BSD3 license-file: LICENSE build-type: Simple stability: experimental cabal-version: >= 1.10.0 tested-with: GHC == 7.8.4, GHC == 7.10.2 synopsis: Alphabet and word representations description: Provides different encoding for characters and words in natural language processing. A character will often be encoded as a unicode text string as we deal with multi-symbol characters. . Internal encoding of IMMC symbols are 0-based integers, which allows for the use of unboxed containers. . A very simple unigram-based scoring scheme and DSL to write such schemes are also provided. . extra-source-files: README.md changelog.md scoring/simpleunigram.score flag llvm description: build the benchmark using LLVM default: False manual: True library build-depends: base > 4.7 && < 4.9 , aeson >= 0.8 && < 0.11 , array >= 0.5 && < 0.6 , attoparsec >= 0.10 && < 0.14 , bimaps >= 0.0.0.4 && < 0.0.1 , binary >= 0.7 && < 0.8 , bytestring >= 0.10.4 , cereal >= 0.4 && < 0.5 , cereal-text >= 0.1 && < 0.2 , deepseq >= 1.3 && < 1.5 , file-embed >= 0.0.6 && < 0.0.10 , hashable >= 1.2 && < 1.3 , hashtables >= 1.1 && < 1.3 , intern >= 0.9 && < 0.10 , QuickCheck >= 2.7 && < 2.9 , stringable >= 0.1.2 && < 0.2 , system-filepath >= 0.4.9 && < 0.5 , text >= 0.11 && < 1.3 , text-binary >= 0.1 && < 0.3 , unordered-containers >= 0.2.3 && < 0.3 , vector >= 0.10 && < 0.12 , vector-th-unbox >= 0.2 && < 0.3 exposed-modules: NLP.Alphabet.IMMC NLP.Alphabet.IMMC.Internal NLP.Alphabet.MultiChar NLP.Scoring.SimpleUnigram NLP.Scoring.SimpleUnigram.Default NLP.Scoring.SimpleUnigram.Import default-language: Haskell2010 default-extensions: BangPatterns , DeriveGeneric , DeriveDataTypeable , GeneralizedNewtypeDeriving , MultiParamTypeClasses , OverloadedStrings , RecordWildCards , TemplateHaskell , TypeFamilies ghc-options: -O2 -funbox-strict-fields benchmark BenchmarkNLA build-depends: base , containers , criterion >= 1.0.2 && < 1.1.1 , deepseq , hashtables , mwc-random >= 0.13 && < 0.14 , NaturalLanguageAlphabets , random >= 1.0 && < 1.2 , unordered-containers , vector hs-source-dirs: tests main-is: Benchmark.hs type: exitcode-stdio-1.0 default-language: Haskell2010 default-extensions: BangPatterns , ScopedTypeVariables ghc-options: -O2 -rtsopts -funbox-strict-fields -funfolding-use-threshold1000 -funfolding-keeness-factor1000 if flag(llvm) ghc-options: -fllvm -optlo-O3 -optlo-std-compile-opts -fllvm-tbaa test-suite properties type: exitcode-stdio-1.0 main-is: properties.hs ghc-options: -threaded -rtsopts -with-rtsopts=-N hs-source-dirs: tests default-language: Haskell2010 default-extensions: ScopedTypeVariables , TemplateHaskell build-depends: base , aeson , binary , cereal , NaturalLanguageAlphabets , QuickCheck , stringable , test-framework >= 0.8 && < 0.9 , test-framework-quickcheck2 >= 0.3 && < 0.4 , test-framework-th >= 0.2 && < 0.3 , text source-repository head type: git location: git://github.com/choener/NaturalLanguageAlphabets