Holumbus-Searchengine: A search and indexing engine.

[ data, library, mit, text ] [ Propose Tags ] [ Report a vulnerability ]

The Holumbus-Searchengine library provides a document indexer and crawler to build indexes over document collections as well as a sophisticated query interface for these indexes.


[Skip to Readme]

Modules

[Last Documentation]

  • Control
    • Concurrent
      • Control.Concurrent.MapFold
    • Monad
      • Control.Monad.ReaderStateIO
    • Sequential
      • Control.Sequential.MapFoldBinary
  • Holumbus
    • Holumbus.Crawler
      • Holumbus.Crawler.CacheCore
      • Holumbus.Crawler.Constants
      • Holumbus.Crawler.Core
      • Holumbus.Crawler.Html
      • Holumbus.Crawler.IndexerCore
      • Holumbus.Crawler.Logger
      • Holumbus.Crawler.PdfToText
      • Holumbus.Crawler.RobotTypes
      • Holumbus.Crawler.Robots
      • Holumbus.Crawler.Types
      • Holumbus.Crawler.URIs
      • Holumbus.Crawler.Util
      • Holumbus.Crawler.XmlArrows
    • Data
      • Holumbus.Data.Crunch
      • Holumbus.Data.PrefixTree
        • Holumbus.Data.PrefixTree.Core
        • Holumbus.Data.PrefixTree.FuzzySearch
        • Holumbus.Data.PrefixTree.PrefixSet
        • Holumbus.Data.PrefixTree.Types
    • Index
      • Holumbus.Index.Common
        • Holumbus.Index.Common.BasicTypes
        • Holumbus.Index.Common.DiffList
        • Holumbus.Index.Common.DocId
        • Holumbus.Index.Common.DocIdMap
        • Holumbus.Index.Common.Document
        • Holumbus.Index.Common.LoadStore
        • Holumbus.Index.Common.Occurences
        • Holumbus.Index.Common.RawResult
      • Holumbus.Index.CompactDocuments
      • Holumbus.Index.CompactIndex
      • Holumbus.Index.CompactSmallDocuments
      • Holumbus.Index.Compression
      • Inverted
        • Holumbus.Index.Inverted.CompressedPrefixMem
        • Holumbus.Index.Inverted.PrefixMem
    • Query
      • Holumbus.Query.Fuzzy
      • Holumbus.Query.Intermediate
      • Language
        • Holumbus.Query.Language.Grammar
        • Holumbus.Query.Language.Parser
      • Holumbus.Query.Processor
      • Holumbus.Query.Ranking
      • Holumbus.Query.Result
    • Holumbus.Utility

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 1.2.0, 1.2.1, 1.2.2, 1.2.3
Dependencies base (>=4 && <5), binary (>=0.5 && <1), bytestring (>=0.9 && <1), bzlib (>=0.4 && <1), containers (>=0.2 && <1), deepseq (>=1.1 && <2), directory (>=1 && <2), enummapset (<1), filepath (>=1 && <2), hslogger (>=1 && <2), hxt (>=9.1 && <10), hxt-cache (>=9 && <10), hxt-curl (>=9 && <10), hxt-regex-xmlschema (>=9 && <10), hxt-unicode (>=9 && <10), mtl (>=1.1 && <3), network (>=2.1 && <3), parallel (>=3.1 && <4), parsec (>=2.1 && <4), process (>=1 && <2), unix (>=2.3 && <3) [details]
License MIT
Copyright Copyright (c) 2007 - 2012 Uwe Schmidt, Sebastian M. Gauck and Timo B. Huebel
Author Sebastian M. Gauck, Timo B. Huebel, Uwe Schmidt
Maintainer Timo B. Huebel <tbh@holumbus.org>, Uwe Schmidt <uwe@fh-wedel.de>
Category Text, Data
Home page http://holumbus.fh-wedel.de
Uploaded by UweSchmidt at 2012-02-14T13:19:48Z
Distributions
Reverse Dependencies 1 direct, 0 indirect [details]
Downloads 3868 total (18 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2016-12-25 [all 8 reports]

Readme for Holumbus-Searchengine-1.2.0

[back to package description]
This is Holumbus.

Version 0.4

Sebastian M. Schlatt - sms@holumbus.org
Timo B. Hübel - tbh@holumbus.org

http://www.holumbus.org

About
-----

Holumbus is a Haskell library which provides the basic building blocks for
creating powerful indexing and search applications. This includes a framework
for distributed crawling and indexing as well as distributed query processing.

Contents
--------

doc       Detailed information about the internals of Holumbus.
examples  Some example applications and utilities.
source    Source code of the Holumbus core library.
test      Several tests for the Holumbus core library.

Requirements
------------

The Holumbus core library requires at least GHC 6.8.2 and the 
following packages (available via Hackage):

- binary 0.4.1
- bzlib 0.4.0.1
- HDBC 1.1.4
- HDBC-sqlite3 1.1.4.0
- hxt 7.5
- regex-compat 0.71.0.1
- utf8-string 0.2

Installation
------------

A Cabal file is provided, therefore Holumbus can be installed using
the standard Cabal way:

$ runhaskell Setup.hs configure
$ runhaskell Setup.hs build
$ runhaskell Setup.hs install # with root privileges

Documentation
-------------

Documentation is provided through the examples and extensive Haddock 
API documentation available online at http://www.holumbus.org.

Details about the internals of the Holumbus framework are available in
the extensive descriptions of the implementation in the "doc" directory.