Safe Haskell	None
Language	Haskell2010

Numeric.Datasets.Netflix

Contents

Dataset files. The directories are scanned recursively and their contents are presented as (FilePath, ByteString) pairs
Data types
Additional types and helper functions
Netflix dataset parsers
Netflix dataset row type parsers
Attoparsec parser combinators
Attoparsec helpers

Description

Netflix prize dataset

From the README :

The movie rating files contain over 100 million ratings from 480 thousand randomly-chosen, anonymous Netflix customers over 17 thousand movie titles. The data were collected between October, 1998 and December, 2005 and reflect the distribution of all ratings received during this period. The ratings are on a scale from 1 to 5 (integral) stars. To protect customer privacy, each customer id has been replaced with a randomly-assigned id. The date of each rating and the title and year of release for each movie id are also provided.

The competition ended on September, 2009, and the dataset was subsequently removed from the public domain by the company.

We include in this repository a tiny subset of the original dataset for development purposes.

For further information, see http://netflixprize.com/.

Synopsis

Dataset files. The directories are scanned recursively and their contents are presented as (FilePath, ByteString) pairs

trainingSet :: [(FilePath, ByteString)] Source #

testSet :: [(FilePath, ByteString)] Source #

movies :: [(FilePath, ByteString)] Source #

Data types

data RatingDate Source #

Constructors

RatingDate
Fields userId :: UserId ratingDate :: Day

Instances

Eq RatingDate Source #
Methods (==) :: RatingDate -> RatingDate -> Bool # (/=) :: RatingDate -> RatingDate -> Bool #
Show RatingDate Source #
Methods showsPrec :: Int -> RatingDate -> ShowS # show :: RatingDate -> String # showList :: [RatingDate] -> ShowS #

newtype UserId Source #

Training set

Constructors

UserId
Fields unUserId :: Int

Instances

Eq UserId Source #
Methods (==) :: UserId -> UserId -> Bool # (/=) :: UserId -> UserId -> Bool #
Show UserId Source #
Methods showsPrec :: Int -> UserId -> ShowS # show :: UserId -> String # showList :: [UserId] -> ShowS #

data Train Source #

Constructors

Train
Fields trainRating :: RatingDate rating :: Int

Instances

Eq Train Source #
Methods (==) :: Train -> Train -> Bool # (/=) :: Train -> Train -> Bool #
Show Train Source #
Methods showsPrec :: Int -> Train -> ShowS # show :: Train -> String # showList :: [Train] -> ShowS #

newtype MovieId Source #

Movies file

Constructors

MovieId
Fields unMovieId :: Int

Instances

Eq MovieId Source #
Methods (==) :: MovieId -> MovieId -> Bool # (/=) :: MovieId -> MovieId -> Bool #
Show MovieId Source #
Methods showsPrec :: Int -> MovieId -> ShowS # show :: MovieId -> String # showList :: [MovieId] -> ShowS #

data Movie Source #

Constructors

Movie
Fields movieId :: MovieId releaseYear :: Day movieTitle :: ByteString

Instances

Eq Movie Source #
Methods (==) :: Movie -> Movie -> Bool # (/=) :: Movie -> Movie -> Bool #
Show Movie Source #
Methods showsPrec :: Int -> Movie -> ShowS # show :: Movie -> String # showList :: [Movie] -> ShowS #

newtype Test Source #

Qualifying file (test set)

Constructors

Test
Fields testRating :: RatingDate

Instances

Eq Test Source #
Methods (==) :: Test -> Test -> Bool # (/=) :: Test -> Test -> Bool #
Show Test Source #
Methods showsPrec :: Int -> Test -> ShowS # show :: Test -> String # showList :: [Test] -> ShowS #

Additional types and helper functions

data TrainCol Source #

Constructors

TrainC
Fields tcMovieId :: MovieId tcTrainSet :: [Train]

Instances

Eq TrainCol Source #
Methods (==) :: TrainCol -> TrainCol -> Bool # (/=) :: TrainCol -> TrainCol -> Bool #
Show TrainCol Source #
Methods showsPrec :: Int -> TrainCol -> ShowS # show :: TrainCol -> String # showList :: [TrainCol] -> ShowS #

data RD a Source #

Constructors

RD
Fields rdRating :: a rdDate :: Day

Instances

Eq a => Eq (RD a) Source #
Methods (==) :: RD a -> RD a -> Bool # (/=) :: RD a -> RD a -> Bool #
Show a => Show (RD a) Source #
Methods showsPrec :: Int -> RD a -> ShowS # show :: RD a -> String # showList :: [RD a] -> ShowS #