datasets: Classical data sets for statistics and machine learning

[ data, data-mining, library, machine-learning, mit, statistics ] [ Propose Tags ]

Classical machine learning and statistics datasets from the UCI Machine Learning Repository and other sources.

The datasets package defines two different kinds of datasets:

import Numeric.Datasets (getDataset)
import Numeric.Datasets.Iris (iris)
import Numeric.Datasets.Abalone (abalone)

main = do
  -- The Iris data set is embedded
  print (length iris)
  print (head iris)
  -- The Abalone dataset is fetched
  abas <- getDataset abalone
  print (length abas)
  print (head abas)
Versions 0.1.0,, 0.2,,,, 0.2.1, 0.2.2, 0.2.3, 0.2.4, 0.2.5
Change log
Dependencies aeson, attoparsec (>=0.13), base (>=4.6 && <5), bytestring, cassava, directory, file-embed, filepath, hashable, microlens, stringsearch, text, time, vector, wreq [details]
License MIT
Author Tom Nielsen <>
Maintainer Tom Nielsen <>
Category Statistics, Machine Learning, Data Mining, Data
Home page
Bug tracker
Source repo head: git clone
Uploaded by glutamate at Mon Jul 31 15:19:16 UTC 2017
Distributions LTSHaskell:0.2.5, NixOS:0.2.5
Downloads 2528 total (33 in the last 30 days)
Rating 2.0 (votes: 1) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs available [build log]
Last success reported on 2017-07-31 [all 1 reports]
Hackage Matrix CI




Maintainer's Corner

For package maintainers and hackage trustees