The snowball package
The Snowball library is used to compute the stems of words in natural languages.
Compared to the older stemmer package, this one:
Correctly handles unicode without relying on the system locale
Takes greater care to avoid memory leaks
Uses Text rather than String
Gets rid of the need for stemWords by using rewrite rules to make map stem efficient
Includes a more recent release of Snowball
Attempts to comply with the Snowball licensing terms
However, although this code is written from scratch, it is heavily modeled after the code of the "stemmer" package.
Properties
| Versions | 0.1.0, 0.1.0.1, 0.1.0.2, 1.0.0, 1.0.0.1 |
|---|---|
| Dependencies | base (4.*), bytestring, text |
| License | BSD3 |
| Copyright | (c) 2012 Dag Odenhall, (c) 2008 Tupil, (c) 2002 Richard Boulton |
| Maintainer | dag.odenhall@gmail.com |
| Category | Natural Language Processing, Text |
| Home page | http://hub.darcs.net/dag/snowball |
| Source repository | darcs get http://hub.darcs.net/dag/snowball |
| Upload date | Sun Dec 16 18:29:58 UTC 2012 |
| Uploaded by | DagOdenhall |
| Built on | ghc-7.6 |
Modules
- Text
Downloads
- snowball-0.1.0.1.tar.gz (Cabal source package)
- package description (included in the package)