full-text-search: In-memory full text search engine

[ bsd3, data, library, search, text ] [ Propose Tags ] [ Report a vulnerability ]

An in-memory full text search engine library. It lets you run full-text queries on a collection of your documents.

Features:

  • Keyword queries and auto-complete/auto-suggest queries.

  • Can search over any type of "document". (You explain how to extract search terms from them.)

  • Supports documents with multiple fields (e.g. title, body)

  • Supports documents with non-term features (e.g. quality score, page rank)

  • Uses the state of the art BM25F ranking function

  • Adjustable ranking parameters (including field weights and non-term feature scores)

  • In-memory but quite compact. It does not keep a copy of your original documents.

  • Quick incremental index updates, making it possible to keep your text search in-sync with your data.

It is independent of the document type, so you have to write the document-specific parts: extracting search terms and any stop words, case-normalisation or stemming. This is quite easy using libraries such as tokenize and snowball.

The source package includes a demo to illustrate how to use the library. The demo is a simplified version of how the library is used in the hackage-server where it provides the backend for the package search feature.

Flags

Automatic Flags
NameDescriptionDefault
build-search-demo

Build a little program illustrating the use of the library

Disabled

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

Versions [RSS] 0.2.0.0, 0.2.1.0, 0.2.1.1, 0.2.1.3, 0.2.1.4, 0.2.2.0, 0.2.2.1, 0.2.2.2 (info)
Change log changelog
Dependencies array (>=0.4 && <0.5), base (>=4.5 && <4.7), bytestring, Cabal (>=1.14 && <1.20), containers (>=0.4 && <0.6), directory, filepath, full-text-search, mtl, snowball (>=1.0 && <1.1), split (>=0.2 && <0.3), tar, text (>=0.11 && <1.2), time, tokenize (>=0.1 && <0.2), transformers, vector (>=0.10 && <0.11) [details]
License BSD-3-Clause
Copyright 2013-2014 Duncan Coutts, 2014 Well-Typed LLP, 2014 IRIS Connect Ltd.
Author Duncan Coutts
Maintainer Duncan Coutts <duncan@well-typed.com>
Category Data, Text, Search
Source repo head: darcs get http://code.haskell.org/full-text-search/
Uploaded by DuncanCoutts at 2014-03-11T13:44:03Z
Distributions NixOS:0.2.2.2
Reverse Dependencies 1 direct, 0 indirect [details]
Executables search-demo
Downloads 5312 total (23 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs available [build log]
Successful builds reported [all 1 reports]