hadoop-formats-0.2.1.1: Read/write file formats commonly used by Hadoop.

Safe HaskellNone
LanguageHaskell2010

Data.Hadoop.SequenceFile.Types

Synopsis

Documentation

data Header Source

The header of a sequence file. Contains the names of the Java classes used to encode the file and potentially some metadata.

Constructors

Header 

Fields

hdKeyType :: !Text

Package qualified class name of the key type.

hdValueType :: !Text

Package qualified class name of the value type.

hdCompressionType :: !Text

Package qualified class name of the compression codec.

hdMetadata :: ![(Text, Text)]

File metadata.

hdSync :: !MD5

The synchronization pattern used to check for corruption throughout the file.

Instances

newtype MD5 Source

An MD5 hash. Stored between each record block in a sequence file to check for corruption.

Constructors

MD5 

Fields

unMD5 :: B.ByteString
 

Instances

data RecordBlock k v Source

A block of key/value pairs. The key at index i always relates to the value at index i. Both vectors will always be the same size.

Constructors

RecordBlock 

Fields

rbCount :: Int

The number of records.

rbKeys :: Collection k

The keys.

rbValues :: Collection v

The values.