heph-aligned-storable: Generically derive Storable instances suitable for CPU-GPU transfer

[ bsd3, graphics, library ] [ Propose Tags ] [ Report a vulnerability ]

Please see the README on GitHub at https://github.com/jtnuttall/heph/tree/main/heph-aligned-storable#readme

[Skip to Readme]

Modules

[Index] [Quick Jump]

Foreign
- GPU
  - Marshal
    - Foreign.GPU.Marshal.Aligned
  - Storable
    - Foreign.GPU.Storable.Aligned

Downloads

heph-aligned-storable-0.1.0.0.tar.gz [browse] (Cabal source package)
Package description (as included in the package)

Maintainer's Corner

Package maintainers

jtnuttall

For package maintainers and hackage trustees

edit package information

Candidates

0.1.0.0

Versions [RSS]	0.1.0.0
Change log	CHANGELOG.md
Dependencies	base (>=4.7 && <5), half (>=0.3 && <0.5), hashable (>=1.3 && <1.6), linear (>=1.20 && <1.24), unliftio (>=0.2 && <0.3), vector (>=0.12.3.0 && <0.14), vector-sized (>=1.4 && <1.7) [details]
Tested with	ghc ==9.8.4 \|\| ==9.6.5 \|\| ==8.10.7 \|\| ==8.10.4
License	BSD-3-Clause
Copyright	2025 Jeremy Nuttall
Author	Jeremy Nuttall
Maintainer	jeremy@jeremy-nuttall.com
Uploaded	by jtnuttall at 2026-02-02T07:06:02Z
Category	Graphics
Home page	https://github.com/jtnuttall/heph/tree/main/heph-aligned-storable#readme
Bug tracker	https://github.com/jtnuttall/heph/issues
Source repo	head: git clone https://github.com/jtnuttall/heph
Distributions
Downloads	3 total (3 in the last 30 days)
Rating	2.0 (votes: 1) [estimated by Bayesian average]
Your Rating	λ λ λ
Status	Docs uploaded by user Build status unknown [no reports yet]

Readme for heph-aligned-storable-0.1.0.0

[back to package description]

heph-aligned-storable

Generically derive Storable instances for GPU memory layouts (std140, std430, scalar).

Quick Start

IMPORTANT: Be sure to use layout(row_major) if you are using linear with this library.

GLSL:

layout(std140, row_major, binding = 0) uniform myuniforms {
  mat4 modelViewProjection;
  vec3 cameraPosition;
  float time;
};

Haskell:

{-# LANGUAGE DeriveGeneric #-}
{-# LANGUAGE DataKinds #-}
{-# LANGUAGE TypeApplications #-}

import Foreign.GPU.Storable.Aligned
import Foreign.GPU.Marshal.Aligned
import GHC.Generics (Generic)
import Linear (M44, V3, V4(..))

data Uniforms = Uniforms
  { modelViewProjection :: M44 Float
  , cameraPosition      :: V3 Float
  , time                :: Float
  } deriving (Generic, Show, Eq)

instance AlignedStorable Std140 Uniforms

main :: IO ()
main = do
  let uniforms = Uniforms
        { modelViewProjection = V4 (V4 1 0 0 0) (V4 0 1 0 0) (V4 0 0 1 0) (V4 0 0 0 1)
        , cameraPosition = V3 0 0 5
        , time = 0
        }
  withPacked @Std140 uniforms $ \ptr -> do
    -- ptr is ready for vkCmdPushConstants, memcpy to mapped buffer, etc.
    pure ()

Features

Correct, spec-compliant padding for Std140, Std430, and Scalar layouts
Single memcpy for arrays via AlignedArray
Type-level layout witnesses prevent mismatched layouts at compile time
Zero runtime overhead—generic machinery fully eliminated by GHC

The Contract

alignedPoke writes member data only. Padding bytes are untouched.

Use the helpers in Foreign.GPU.Marshal.Aligned (withPacked, allocaPacked, etc.) for guaranteed zero-initialized padding. If you allocate memory yourself, use calloc or zero the buffer before poking.

Arrays

By default, arrays are poked element-by-element. For a single memcpy, wrap in AlignedArray:

data MyStruct (layout :: MemoryLayout) = MyStruct
  { meta   :: Float
  , pixels :: AlignedArray layout 64 (V4 Float)  -- memcpy'd as a block
  } deriving Generic

instance AlignedStorable Std140 (MyStruct Std140)

Gotchas

Matrix naming conventions

linear uses Mnm for n rows × m columns. GLSL uses matNxM for N columns of M-vectors.

M32 Float (3 rows, 2 cols) → mat2x3
M24 Double (2 rows, 4 cols) → dmat4x2

`row_major`

GLSL's layout(row_major) affects memory layout, not matrix semantics. Matrices are still column-major for arithmetic. This library implements the memory layout correctly. You don't need to transpose before upload.

`vec3` and `mat3` are cursed

Driver handling of the round-up rules for these types has historically been inconsistent. Consider padding to vec4/mat4 and pretending the 3-element variants don't exist.

Why not `derive-storable`?

derive-storable produces FFI-compatible layouts (C struct ABI), not GPU layouts. GPU alignment rules differ:

std140 rounds struct alignment to 16 bytes
scalar layout requires 4-byte booleans, not 1-byte