Holumbus-Searchengine-1.2.3: A search and indexing engine.

Safe HaskellNone

Holumbus.Crawler.PdfToText

Synopsis

Documentation

pdfToText :: String -> IO StringSource

Conversion of pdf data into plain text. The conversion is done by calling an external program pdftotext (contained in linux packages xpdf). IO is done via the ByteString API.