snowball: Bindings to the Snowball library.

[ bsd3, library, natural-language-processing, text ] [ Propose Tags ]

The Snowball library is used to compute the stems of words in natural languages.

Compared to the older stemmer package, this one:

  • Correctly handles unicode without relying on the system locale

  • Takes greater care to avoid memory leaks

  • Uses Text rather than String

  • Gets rid of the need for stemWords by using rewrite rules to make map stem efficient

  • Includes a more recent release of Snowball

  • Attempts to comply with the Snowball licensing terms

However, although this code is written from scratch, it is heavily modeled after the code of the "stemmer" package.

Modules

[Index]

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.1.0, 0.1.0.1, 0.1.0.2, 1.0.0, 1.0.0.1
Dependencies base (>=4 && <5), bytestring, text [details]
License BSD-3-Clause
Copyright (c) 2012 Dag Odenhall, (c) 2008 Tupil, (c) 2002 Richard Boulton
Author
Maintainer dag.odenhall@gmail.com
Category Natural Language Processing, Text
Home page http://hub.darcs.net/dag/snowball
Source repo head: darcs get http://hub.darcs.net/dag/snowball
Uploaded by DagOdenhall at 2012-12-16T18:29:58Z
Distributions NixOS:1.0.0.1
Reverse Dependencies 1 direct, 0 indirect [details]
Downloads 5627 total (18 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]