spelling-suggest: Spelling suggestion tool with library and command-line interfaces.

[ bsd3, console, library, program, text ] [ Propose Tags ]

"thimk" (an old joke) is a command-line spelling word suggestion tool. You give it a possibly-misspelled word, and it spits out one or more properly-spelled words in order of likelihood of similarity.

This functionality is also exported as a library via Text.SpellingSuggest (suggest)

There is an optional precompiled SQlite database of phonetic codes for the entire dictionary, created with "thimk-makedb". This greatly speeds lookup, permitting reasonable performance on enormous dictionaries.

[Skip to Readme]

Modules

[Index]

Text
- Text.SpellingSuggest

Flags

Automatic Flags

Name	Description	Default
debug		Enabled

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

spelling-suggest-0.5.1.tar.gz [browse] (Cabal source package)
Package description (as included in the package)

Maintainer's Corner

Package maintainers

BartonMassey, GregWeber

For package maintainers and hackage trustees

edit package information

Candidates

No Candidates

Versions [RSS]	0.5.0, 0.5.0.1, 0.5.1, 0.5.1.0, 0.5.2.0, 0.5.2.1
Dependencies	base (>=4.2 && <5), edit-distance (>=0.1 && <0.3), parseargs (>=0.1.1 && <0.2), phonetic-code (>=0.1 && <0.2), sqlite (>=0.5.1 && <0.6) [details]
License	BSD-3-Clause
Copyright	Copyright © 2010 Greg Weber and Bart Massey
Author	Greg Weber and Bart Massey
Maintainer	bart@cs.pdx.edu, greg@gregweber.info
Category	Console, Text
Home page	https://github.com/BartMassey/haskell-spell-suggest
Source repo	head: git clone git://github.com/BartMassey/haskell-spell-suggest.git this: git clone git://github.com/BartMassey/haskell-spell-suggest.git(tag v0.5.1)
Uploaded	by BartonMassey at 2012-08-26T18:19:52Z
Distributions
Reverse Dependencies	1 direct, 0 indirect [details]
Executables	thimk-makedb, thimk
Downloads	4377 total (13 in the last 30 days)
Rating	(no votes yet) [estimated by Bayesian average]
Your Rating	λ λ λ
Status	Docs uploaded by user Build status unknown [no reports yet]

Readme for spelling-suggest-0.5.1

[back to package description]

Spelling word suggestion tool
Copyright © 2008 Bart Massey
ALL RIGHTS RESERVED

This software is licensed under the "3-clause ('new')
BSD License".  Please see the file COPYING provided with
this distribution for license terms.

"thimk" (an old joke) is a command-line spelling word
suggestion tool.  You give it a possibly-misspelled word,
and it spits out one or more properly-spelled words in order
of likelihood of similarity.

The idea and name for thimk came from an old program that used to hang
around Reed College, probably written by Graham Ross and
now apparently lost in the mists of time.
See <http://groups.google.com/group/net.sources/msg/8856593862fe22bd>
for the one very vague reference I've found on the web (in the
SEE ALSO section of the referenced manpage).

The current implementation is a bit more sophisticated
than I recall the original being. By
default it uses a prefilter that discards words with
large edit distances from the target, then filters words
with a different phonetic code than the target, then
presents the top result sorted by edit distance.

The Soundex and Phonix phonetic codes are designed for
names, but seem to work about the same with other words.
I follow the common practice of not truncating the codes
for greater precision, although Phonix does truncate its
final "sound" for greater recall.

The latest change to the implementation is an addition
of an optional precompiled SQlite database of phonetic
codes for the entire dictionary, created with
"thimk-makedb".  This greatly speeds lookup, permitting
reasonable performance on enormous dictionaries.