Safe Haskell	None
Language	Haskell2010

Data.Hashabler

Contents

Hash Functions
- Hashing with the SipHash algorithm
- Hashing with the FNV-1a algorithm
  - FNV-1a Internal Parameters
Hashable types
- Creating your own Hashable instances
Stable "hashes" of Types
Implementing new hash functions
- Detailed discussion of principled Hashable instances

Synopsis

Documentation

The core of this library consists of

the Hashable class which defines how hashable chunks of bytes are delivered to the data-consuming portion of a hash function; new instances can be defined to support the hashing of new datatypes using an existing algorithm
the HashState class which implements the data-consuming portion of a particular hashing algorithm, consuming bytes delivered in hash; a new instance can be defined to implement a new hashing function that works on existing Hashable types.

We also include implementations for the following hash functions: hashFNV32, hashFNV64, siphash64, and siphash128.

Please see the project description for more information, including motivation.

Hash Functions

Hashes of different widths. We tag these hash types with the types of the data they were produced from so that e.g. we get a sensible Eq instance.

newtype Hash32 a Source #

Constructors

Hash32
Fields hashWord32 :: Word32

Instances

Eq (Hash32 k a) Source #
Methods (==) :: Hash32 k a -> Hash32 k a -> Bool # (/=) :: Hash32 k a -> Hash32 k a -> Bool #
Read (Hash32 k a) Source #
Methods readsPrec :: Int -> ReadS (Hash32 k a) # readList :: ReadS [Hash32 k a] # readPrec :: ReadPrec (Hash32 k a) # readListPrec :: ReadPrec [Hash32 k a] #
Show (Hash32 k a) Source #
Methods showsPrec :: Int -> Hash32 k a -> ShowS # show :: Hash32 k a -> String # showList :: [Hash32 k a] -> ShowS #

newtype Hash64 a Source #

Constructors

Hash64
Fields hashWord64 :: Word64

Instances

Eq (Hash64 k a) Source #
Methods (==) :: Hash64 k a -> Hash64 k a -> Bool # (/=) :: Hash64 k a -> Hash64 k a -> Bool #
Read (Hash64 k a) Source #
Methods readsPrec :: Int -> ReadS (Hash64 k a) # readList :: ReadS [Hash64 k a] # readPrec :: ReadPrec (Hash64 k a) # readListPrec :: ReadPrec [Hash64 k a] #
Show (Hash64 k a) Source #
Methods showsPrec :: Int -> Hash64 k a -> ShowS # show :: Hash64 k a -> String # showList :: [Hash64 k a] -> ShowS #

data Hash128 a Source #

Constructors

Hash128
Fields hashWord128_0 :: !Word64 hashWord128_1 :: !Word64

Instances

Eq (Hash128 k a) Source #
Methods (==) :: Hash128 k a -> Hash128 k a -> Bool # (/=) :: Hash128 k a -> Hash128 k a -> Bool #
Read (Hash128 k a) Source #
Methods readsPrec :: Int -> ReadS (Hash128 k a) # readList :: ReadS [Hash128 k a] # readPrec :: ReadPrec (Hash128 k a) # readListPrec :: ReadPrec [Hash128 k a] #
Show (Hash128 k a) Source #
Methods showsPrec :: Int -> Hash128 k a -> ShowS # show :: Hash128 k a -> String # showList :: [Hash128 k a] -> ShowS #

Hashing with the SipHash algorithm

SipHash is a fast hashing algorithm with very good mixing properties, designed to be very secure against hash-flooding DOS attacks. SipHash is a good choice whenever your application may be hashing untrusted user data.

data SipKey Source #

A 128-bit secret key. This should be generated randomly and must be kept secret.

Constructors

SipKey !Word64 !Word64

Instances

Eq SipKey Source #
Methods (==) :: SipKey -> SipKey -> Bool # (/=) :: SipKey -> SipKey -> Bool #
Read SipKey Source #
Methods readsPrec :: Int -> ReadS SipKey # readList :: ReadS [SipKey] # readPrec :: ReadPrec SipKey # readListPrec :: ReadPrec [SipKey] #
Show SipKey Source #
Methods showsPrec :: Int -> SipKey -> ShowS # show :: SipKey -> String # showList :: [SipKey] -> ShowS #

siphash64 :: Hashable a => SipKey -> a -> Hash64 a Source #

An implementation of 64-bit siphash-2-4.

This function is fast on 64-bit machines, and provides very good hashing properties and protection against hash flooding attacks.

This uses the "standard" recommended parameters of 2 and 4 rounds, recommended by the original paper, but siphash64_1_3 may be a faster and equally secure choice.

siphash64_1_3 :: Hashable a => SipKey -> a -> Hash64 a Source #

An implementation of 64-bit siphash-1-3.

This is somewhat faster than siphash-2-4 (implemented in siphash64), while the authors claim it should still offer good protection against known attacks. This is currently the standard hash function used in the Rust language.

siphash128 :: Hashable a => SipKey -> a -> Hash128 a Source #

An implementation of 128-bit siphash-2-4.

This function is fast on 64-bit machines, and provides very good hashing properties and protection against hash flooding attacks.

Hashing with the FNV-1a algorithm

The FNV-1a hash (see http://www.isthe.com/chongo/tech/comp/fnv/) is a fast and extremely simple hashing algorithm with fairly good mixing properties. Its simplicity makes it a good choice if you need to implement the same hashing routines on multiple platforms e.g. to verify a hash generated in JS on a web client with a hash stored on your server.

If you are hashing untrusted user data and are concerned with hash flooding attacks, consider SipHash instead; performance is about the same in the current implementation.

hashFNV32 :: Hashable a => a -> Hash32 a Source #

Hash a value using the standard spec-prescribed 32-bit seed value.

  hashFNV32 = Hash32 . fnv32 . hash fnvOffsetBasis32

hashFNV64 :: Hashable a => a -> Hash64 a Source #

Hash a value using the standard spec-prescribed 64-bit seed value. This may be slow on 32-bit machines.

  hashFNV64 = Hash64 . fnv64 . hash fnvOffsetBasis64

FNV-1a Internal Parameters

Magic FNV primes:

fnvPrime32 :: Word32 Source #

fnvPrime64 :: Word64 Source #

The arbitrary initial seed values for different output hash sizes. These values are part of the spec, but there is nothing special about them; supposedly, in terms of hash quality, any non-zero value seed should be fine passed to hash:

fnvOffsetBasis32 :: FNV32 Source #

fnvOffsetBasis64 :: FNV64 Source #

Hashable types

class Hashable a where Source #

A class of types that can be converted into a hash value. We expect all instances to display "good" hashing properties (wrt avalanche, bit independence, etc.) when passed to an ideal hash function.

We try to ensure that bytes are extracted from values in a way that is portable across architectures (where possible), and straightforward to replicate on other platforms and in other languages. Portable instances are also instances of StableHashable, and non-portable instances are NOTE-ed in instance docs here as well.

See the section "Defining Hashable instances" for details of what we expect from instances.

Minimal complete definition

hash

Methods

hash :: HashState h => h -> a -> h Source #

Add the bytes from the second argument into the hash, producing a new hash value. This is essentially a left fold of the methods of HashState over individual bytes extracted from a.

For some instances of HashState, this method might be a complete hashing algorithm, or might comprise the core of a hashing algorithm (perhaps with some final mixing), or might do something completely apart from hashing (e.g. simply cons bytes into a list for debugging).

Implementations must ensure that, for the same data:

Word16/32/64 arguments passed into the methods of HashState, and...
the choice of mix function itself...

...are consistent across architectures of different word size and endianness. For example do not define an instance which conditionally implements mix64 only on 64-bit architectures.

Instances

Hashable Bool Source #	hash h = hash h . \b-> if b then (1::Word8) else 0
Methods hash :: HashState h => h -> Bool -> h Source #
Hashable Char Source #	Hash a `Char` as big endian UTF-16. Note that Char permits values in the reserved unicode range U+D800 to U+DFFF; these Char values are added to the hash just as if they were valid 16-bit characters.
Methods hash :: HashState h => h -> Char -> h Source #
Hashable Double Source #	Hash a Double by way of `decodeFloat`. NOTE: this means 0 and -0 are considered equal, among other things. In general floating point values cannot be reasonably compared for absolute equality, so be careful if you make use of this instance.
Methods hash :: HashState h => h -> Double -> h Source #
Hashable Float Source #	Hash a Float by way of `decodeFloat`. NOTE: this means 0 and -0 are considered equal, among other things. In general floating point values cannot be reasonably compared for absolute equality, so be careful if you make use of this instance.
Methods hash :: HashState h => h -> Float -> h Source #
Hashable Int Source #	NOTE: `Int` has platform-dependent size. When hashing on 64-bit machines if the `Int` value to be hashed falls in the 32-bit Int range, we first cast it to an Int32. This should help ensure that programs that are correct across architectures will also produce the same hash values.
Methods hash :: HashState h => h -> Int -> h Source #
Hashable Int8 Source #
Methods hash :: HashState h => h -> Int8 -> h Source #
Hashable Int16 Source #
Methods hash :: HashState h => h -> Int16 -> h Source #
Hashable Int32 Source #
Methods hash :: HashState h => h -> Int32 -> h Source #
Hashable Int64 Source #
Methods hash :: HashState h => h -> Int64 -> h Source #
Hashable Integer Source #	Arbitrary-precision integers are hashed as follows: the magnitude is represented with 32-bit chunks (at least one, for zero; but no more than necessary), then bytes are added to the hash from most to least significant (including all initial padding 0s). Finally `mixConstructor` is called on the resulting hash value, with `(1::Word8)` if the `Integer` was negative, otherwise with `0`.
Methods hash :: HashState h => h -> Integer -> h Source #
Hashable Ordering Source #
Methods hash :: HashState h => h -> Ordering -> h Source #
Hashable Word Source #	NOTE: `Word` has platform-dependent size. When hashing on 64-bit machines if the `Word` value to be hashed falls in the 32-bit Word range, we first cast it to a Word32. This should help ensure that programs that are correct across architectures will also produce the same hash values.
Methods hash :: HashState h => h -> Word -> h Source #
Hashable Word8 Source #
Methods hash :: HashState h => h -> Word8 -> h Source #
Hashable Word16 Source #
Methods hash :: HashState h => h -> Word16 -> h Source #
Hashable Word32 Source #
Methods hash :: HashState h => h -> Word32 -> h Source #
Hashable Word64 Source #
Methods hash :: HashState h => h -> Word64 -> h Source #
Hashable TypeRep Source #	NOTE: no promise of consistency across platforms or GHC versions.
Methods hash :: HashState h => h -> TypeRep -> h Source #
Hashable () Source #	hash = const . mixConstructor 0
Methods hash :: HashState h => h -> () -> h Source #
Hashable BigNat Source #	The `BigNat`'s value is represented in 32-bit chunks (at least one, for zero; but no more than necessary), then bytes are added to the hash from most to least significant (including all initial padding 0s). Finally `mixConstructor 0` is called on the resulting hash value. Exposed only in GHC 7.10.
Methods hash :: HashState h => h -> BigNat -> h Source #
Hashable Natural Source #	The `Natural`'s value is represented in 32-bit chunks (at least one, for zero; but no more than necessary), then bytes are added to the hash from most to least significant (including all initial padding 0s). Finally `mixConstructor 0` is called on the resulting hash value. Exposed only in GHC 7.10
Methods hash :: HashState h => h -> Natural -> h Source #
Hashable Void Source #	hash _ _ = absurd Exposed only in GHC 7.10
Methods hash :: HashState h => h -> Void -> h Source #
Hashable Version Source #	The (now deprecated) `versionTags` field is ignored, and we follow the `Eq` instance which does not ignore trailing zeros.
Methods hash :: HashState h => h -> Version -> h Source #
Hashable Unique Source #	NOTE: No promise of stability across runs or platforms. Implemented via `hashUnique`.
Methods hash :: HashState h => h -> Unique -> h Source #
Hashable ThreadId Source #	NOTE: no promise of consistency across runs or platforms.
Methods hash :: HashState h => h -> ThreadId -> h Source #
Hashable ShortByteString Source #	Exposed only in bytestring >= v0.10.4
Methods hash :: HashState h => h -> ShortByteString -> h Source #
Hashable ByteString Source #	Lazy `ByteString`
Methods hash :: HashState h => h -> ByteString -> h Source #
Hashable ByteString Source #	Strict `ByteString`
Methods hash :: HashState h => h -> ByteString -> h Source #
Hashable ByteArray Source #	Here we hash each byte of the array in turn. If using this to hash some data stored internally as a `ByteArray#`, be aware that depending on the size and alignment requirements of that data, as well as the endianness of your machine, this might result in different hash values across different architectures.
Methods hash :: HashState h => h -> ByteArray -> h Source #
Hashable Text Source #	Lazy `Text`, hashed as big endian UTF-16.
Methods hash :: HashState h => h -> Text -> h Source #
Hashable Text Source #	Strict `Text`, hashed as big endian UTF-16.
Methods hash :: HashState h => h -> Text -> h Source #
Hashable a => Hashable [a] Source #
Methods hash :: HashState h => h -> [a] -> h Source #
Hashable a => Hashable (Maybe a) Source #
Methods hash :: HashState h => h -> Maybe a -> h Source #
(Integral a, Hashable a) => Hashable (Ratio a) Source #	hash s a = s `hash` numerator a `hash` denominator a
Methods hash :: HashState h => h -> Ratio a -> h Source #
Hashable (StableName a) Source #	NOTE: No promise of stability across runs or platforms. Implemented via `hashStableName`.
Methods hash :: HashState h => h -> StableName a -> h Source #
(Hashable a, Hashable b) => Hashable (Either a b) Source #
Methods hash :: HashState h => h -> Either a b -> h Source #
(Hashable a1, Hashable a2) => Hashable (a1, a2) Source #
Methods hash :: HashState h => h -> (a1, a2) -> h Source #
(Hashable a1, Hashable a2, Hashable a3) => Hashable (a1, a2, a3) Source #
Methods hash :: HashState h => h -> (a1, a2, a3) -> h Source #
(Hashable a1, Hashable a2, Hashable a3, Hashable a4) => Hashable (a1, a2, a3, a4) Source #
Methods hash :: HashState h => h -> (a1, a2, a3, a4) -> h Source #
(Hashable a1, Hashable a2, Hashable a3, Hashable a4, Hashable a5) => Hashable (a1, a2, a3, a4, a5) Source #
Methods hash :: HashState h => h -> (a1, a2, a3, a4, a5) -> h Source #
(Hashable a1, Hashable a2, Hashable a3, Hashable a4, Hashable a5, Hashable a6) => Hashable (a1, a2, a3, a4, a5, a6) Source #
Methods hash :: HashState h => h -> (a1, a2, a3, a4, a5, a6) -> h Source #
(Hashable a1, Hashable a2, Hashable a3, Hashable a4, Hashable a5, Hashable a6, Hashable a7) => Hashable (a1, a2, a3, a4, a5, a6, a7) Source #
Methods hash :: HashState h => h -> (a1, a2, a3, a4, a5, a6, a7) -> h Source #
(Hashable a1, Hashable a2, Hashable a3, Hashable a4, Hashable a5, Hashable a6, Hashable a7, Hashable a8) => Hashable (a1, a2, a3, a4, a5, a6, a7, a8) Source #
Methods hash :: HashState h => h -> (a1, a2, a3, a4, a5, a6, a7, a8) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i) => Hashable (a, b, c, d, e, f, g, h, i) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i, Hashable j) => Hashable (a, b, c, d, e, f, g, h, i, j) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i, j) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i, Hashable j, Hashable k) => Hashable (a, b, c, d, e, f, g, h, i, j, k) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i, j, k) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i, Hashable j, Hashable k, Hashable l) => Hashable (a, b, c, d, e, f, g, h, i, j, k, l) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i, j, k, l) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i, Hashable j, Hashable k, Hashable l, Hashable m) => Hashable (a, b, c, d, e, f, g, h, i, j, k, l, m) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i, j, k, l, m) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i, Hashable j, Hashable k, Hashable l, Hashable m, Hashable n) => Hashable (a, b, c, d, e, f, g, h, i, j, k, l, m, n) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i, j, k, l, m, n) -> h Source #
(Hashable a, Hashable b, Hashable c, Hashable d, Hashable e, Hashable f, Hashable g, Hashable h, Hashable i, Hashable j, Hashable k, Hashable l, Hashable m, Hashable n, Hashable o) => Hashable (a, b, c, d, e, f, g, h, i, j, k, l, m, n, o) Source #
Methods hash :: HashState h => h -> (a, b, c, d, e, f, g, h, i, j, k, l, m, n, o) -> h Source #

Creating your own Hashable instances

When defining Hashable instances for your own algebraic data types you should do the following.

For types with a single constructor, simply call hash on each of the constructor's children, for instance:

instance (Hashable a, Hashable b, Hashable c) => Hashable (a, b, c) where
    hash h (a,b,c) = h `hash` a `hash` b `hash` c

And when a type has multiple constructors you should additionally call mixConstructor with a different argument for each constructor.

instance (Hashable a, Hashable b) => Hashable (Eithers a b) where
    hash h (Lefts a0 a1)     = mixConstructor 0 (h `hash` a0 `hash` a1)
    hash h (Rights b0 b1 b2) = mixConstructor 1 (h `hash` b0 `hash` b1 `hash` b2)

In the future we may offer a way to derive instances like this automatically.

mixConstructor Source #

Arguments

:: HashState h
=> Word8	Constructor number. We recommend starting from 0 and incrementing.
-> h	Hash state value to mix our byte into
-> h	New hash state

mixConstructor n h = h `mix8` (0xFF - n)

Stable "hashes" of Types

class Hashable a => StableHashable a where Source #

Types whose hashes can be compared across platforms. This is somewhat like a limited, but cross-platform Typeable.

Instances are expected to be universally-unique, and should be generated randomly. Type parameters can be hashed together using mixType, like:

instance (StableHashable b) => StableHashable (A b) where
    typeHash = mixType (TypeHash 530184177609460980)

When Hashable instances change, the TypeHash must be changed to a new random value. This lets us "version" a set of hashes; if we store a TypeHash along with a set of hashes in program A, in program B we can compare the stored value with our own TypeHash and verify that hashes we generate in program B can be meaningfully compared.

Note, obviously this doesn't ensure that values were hashed with the same hashing algorithm, and you should come up with your own means to serialize that information if you need to.

Minimal complete definition

typeHash

Methods

typeHash :: TypeHash a Source #

Instances

StableHashable Bool Source #
Methods typeHash :: TypeHash * Bool Source #
StableHashable Char Source #
Methods typeHash :: TypeHash * Char Source #
StableHashable Double Source #
Methods typeHash :: TypeHash * Double Source #
StableHashable Float Source #
Methods typeHash :: TypeHash * Float Source #
StableHashable Int Source #	The value here depends on whether we're on a 32 or 64-bit platform. See also the instance documentation for `Hashable`.
Methods typeHash :: TypeHash * Int Source #
StableHashable Int8 Source #
Methods typeHash :: TypeHash * Int8 Source #
StableHashable Int16 Source #
Methods typeHash :: TypeHash * Int16 Source #
StableHashable Int32 Source #
Methods typeHash :: TypeHash * Int32 Source #
StableHashable Int64 Source #
Methods typeHash :: TypeHash * Int64 Source #
StableHashable Integer Source #
Methods typeHash :: TypeHash * Integer Source #
StableHashable Ordering Source #
Methods typeHash :: TypeHash * Ordering Source #
StableHashable Word Source #	The value here depends on whether we're on a 32 or 64-bit platform. See also the instance documentation for `Hashable`.
Methods typeHash :: TypeHash * Word Source #
StableHashable Word8 Source #
Methods typeHash :: TypeHash * Word8 Source #
StableHashable Word16 Source #
Methods typeHash :: TypeHash * Word16 Source #
StableHashable Word32 Source #
Methods typeHash :: TypeHash * Word32 Source #
StableHashable Word64 Source #
Methods typeHash :: TypeHash * Word64 Source #
StableHashable () Source #
Methods typeHash :: TypeHash * () Source #
StableHashable BigNat Source #
Methods typeHash :: TypeHash * BigNat Source #
StableHashable Natural Source #
Methods typeHash :: TypeHash * Natural Source #
StableHashable Void Source #
Methods typeHash :: TypeHash * Void Source #
StableHashable Version Source #
Methods typeHash :: TypeHash * Version Source #
StableHashable ShortByteString Source #
Methods typeHash :: TypeHash * ShortByteString Source #
StableHashable ByteString Source #
Methods typeHash :: TypeHash * ByteString Source #
StableHashable ByteString Source #
Methods typeHash :: TypeHash * ByteString Source #
StableHashable ByteArray Source #
Methods typeHash :: TypeHash * ByteArray Source #
StableHashable Text Source #
Methods typeHash :: TypeHash * Text Source #
StableHashable Text Source #
Methods typeHash :: TypeHash * Text Source #
StableHashable a => StableHashable [a] Source #
Methods typeHash :: TypeHash * [a] Source #
StableHashable a => StableHashable (Maybe a) Source #
Methods typeHash :: TypeHash * (Maybe a) Source #
(Integral a, StableHashable a) => StableHashable (Ratio a) Source #
Methods typeHash :: TypeHash * (Ratio a) Source #
(StableHashable a, StableHashable b) => StableHashable (Either a b) Source #
Methods typeHash :: TypeHash * (Either a b) Source #
(StableHashable a, StableHashable b) => StableHashable (a, b) Source #
Methods typeHash :: TypeHash * (a, b) Source #
(StableHashable a, StableHashable b, StableHashable c) => StableHashable (a, b, c) Source #
Methods typeHash :: TypeHash * (a, b, c) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d) => StableHashable (a, b, c, d) Source #
Methods typeHash :: TypeHash * (a, b, c, d) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e) => StableHashable (a, b, c, d, e) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f) => StableHashable (a, b, c, d, e, f) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g) => StableHashable (a, b, c, d, e, f, g) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h) => StableHashable (a, b, c, d, e, f, g, h) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i) => StableHashable (a, b, c, d, e, f, g, h, i) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i, StableHashable j) => StableHashable (a, b, c, d, e, f, g, h, i, j) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i, j) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i, StableHashable j, StableHashable k) => StableHashable (a, b, c, d, e, f, g, h, i, j, k) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i, j, k) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i, StableHashable j, StableHashable k, StableHashable l) => StableHashable (a, b, c, d, e, f, g, h, i, j, k, l) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i, j, k, l) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i, StableHashable j, StableHashable k, StableHashable l, StableHashable m) => StableHashable (a, b, c, d, e, f, g, h, i, j, k, l, m) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i, j, k, l, m) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i, StableHashable j, StableHashable k, StableHashable l, StableHashable m, StableHashable n) => StableHashable (a, b, c, d, e, f, g, h, i, j, k, l, m, n) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i, j, k, l, m, n) Source #
(StableHashable a, StableHashable b, StableHashable c, StableHashable d, StableHashable e, StableHashable f, StableHashable g, StableHashable h, StableHashable i, StableHashable j, StableHashable k, StableHashable l, StableHashable m, StableHashable n, StableHashable o) => StableHashable (a, b, c, d, e, f, g, h, i, j, k, l, m, n, o) Source #
Methods typeHash :: TypeHash * (a, b, c, d, e, f, g, h, i, j, k, l, m, n, o) Source #

newtype TypeHash a Source #

A value that uniquely identifies a StableHashable type. This serves to both version a type with respect to its Hashable instance, and distinguish types from each other (similar to TypeRep) across program runs, platforms and library versions.

Constructors

TypeHash
Fields typeHashWord :: Word64

Instances

Eq (TypeHash k a) Source #
Methods (==) :: TypeHash k a -> TypeHash k a -> Bool # (/=) :: TypeHash k a -> TypeHash k a -> Bool #
Read (TypeHash k a) Source #
Methods readsPrec :: Int -> ReadS (TypeHash k a) # readList :: ReadS [TypeHash k a] # readPrec :: ReadPrec (TypeHash k a) # readListPrec :: ReadPrec [TypeHash k a] #
Show (TypeHash k a) Source #
Methods showsPrec :: Int -> TypeHash k a -> ShowS # show :: TypeHash k a -> String # showList :: [TypeHash k a] -> ShowS #
Bits (TypeHash k a) Source #
Methods (.&.) :: TypeHash k a -> TypeHash k a -> TypeHash k a # (.\|.) :: TypeHash k a -> TypeHash k a -> TypeHash k a # xor :: TypeHash k a -> TypeHash k a -> TypeHash k a # complement :: TypeHash k a -> TypeHash k a # shift :: TypeHash k a -> Int -> TypeHash k a # rotate :: TypeHash k a -> Int -> TypeHash k a # zeroBits :: TypeHash k a # bit :: Int -> TypeHash k a # setBit :: TypeHash k a -> Int -> TypeHash k a # clearBit :: TypeHash k a -> Int -> TypeHash k a # complementBit :: TypeHash k a -> Int -> TypeHash k a # testBit :: TypeHash k a -> Int -> Bool # bitSizeMaybe :: TypeHash k a -> Maybe Int # bitSize :: TypeHash k a -> Int # isSigned :: TypeHash k a -> Bool # shiftL :: TypeHash k a -> Int -> TypeHash k a # unsafeShiftL :: TypeHash k a -> Int -> TypeHash k a # shiftR :: TypeHash k a -> Int -> TypeHash k a # unsafeShiftR :: TypeHash k a -> Int -> TypeHash k a # rotateL :: TypeHash k a -> Int -> TypeHash k a # rotateR :: TypeHash k a -> Int -> TypeHash k a # popCount :: TypeHash k a -> Int #

mixType :: forall a t. StableHashable a => TypeHash t -> TypeHash (t a) Source #

A helper for implementing typeHash by xor-ing type parameters with a new random hash value. E.g.:

 instance (StableHashable a, StableHashable b) => StableHashable (a, b) where
     typeHash = mixType (mixType (TypeHash 12071780118071628513))
                                           __ a new random value for (,)
                            ____ mix in the type hash for a
                    __ mix in the type hash for b

typeHashOf :: StableHashable a => a -> TypeHash a Source #

typeHashOfProxy :: StableHashable a => proxy a -> TypeHash a Source #

Implementing new hash functions

class HashState h where Source #

A class for defining how a hash function consumes input data. Bytes are fed to these methods in our Hashable instances, which promise to call these methods in a platform-independent way.

Instances of HashState only need to define mix8, but may additionally handle mix-ing in larger word chunks for performance reasons. For instance a hash function which operates on four bytes at a time might make use of mix32, and perhaps in mix8 pad with three additional 0s.

Endianness is normalized in Hashable instances, so these mix methods can expect to receive identical words across platforms.

Minimal complete definition

mix8

Methods

mix8 :: h -> Word8 -> h Source #

Mix in one byte.

mix16 :: h -> Word16 -> h Source #

Mix in a 2-byte word. Defaults to two mix8 on bytes from most to least significant.

mix32 :: h -> Word32 -> h Source #

Mix in a 4-byte word. Defaults to four mix8 on bytes from most to least significant.

mix64 :: h -> Word64 -> h Source #

Mix in a 8-byte word. Defaults to two mix32 on 32-byte words from most to least significant.

Detailed discussion of principled Hashable instances

This is a work-in-progress, and purely IYI.

Special care needs to be taken when defining instances of Hashable for your own types, especially for recursive types and types with multiple constructors. First instances need to ensure that distinct values produce distinct hash values. Here's an example of a bad implementation for Maybe:

instance (Hashable a)=> Hashable (Maybe a) where -- BAD!
    hash h (Just a) = h `hash` a                 -- BAD!
    hash h Nothing  = h `hash` (1::Word8)        -- BAD!

Here Just (1::Word8) hashes to the same value as Nothing.

Second and more tricky, instances should not permit a function f :: a -> (a,a) such that x hash y == x hash y1 hash y2 where (y1,y2) = f y... or something. The idea is we want to avoid the following kinds of collisions:

hash [Just 1, Nothing] == hash [Just 1]          -- BAD!
hash ([1,2], [3])      == hash ([1], [2,3])      -- BAD!

Maybe what we mean is that where a is a Monoid, we expect replacing mappend with the hash operation to always yield different values. This needs clarifying; please help.

Here are a few rules of thumb which should result in principled instances for your own types (This is a work-in-progress; please help):

If all values of a type have a static structure, i.e. the arrangement and number of child parts to be hashed is knowable from the type, then one may simply hash each child element of the type in turn. This is the case for product types like tuples (where the arity is reflected in the type), or primitive numeric values composed of a static number of bits.

Otherwise if the type has variable structure, e.g. if it has multiple constructors or is an array type...

Every possible value of a type should inject at least one byte of entropy apart from any recursive calls to child elements; we can ensure this is the case by hashing an initial or final distinct byte for each distinct constructor of our type

To ensure hashing remains consistent across platforms, instances should not compile-time-conditionally call different mix-family HashState functions. This rule doesn't matter for instances like FNV32 which mix in data one byte at a time, but other HashState instances may operate on multiple bytes at a time, perhaps using padding bytes, so this becomes important.

A final important note: we're not concerned with collisions between values of different types; in fact in many cases "equivalent" values of different types intentionally hash to the same value. This also means instances cannot rely on the hashing of child elements being uncorrelated. That might be one interpretation of the mistake in our faulty Maybe instance above