futhark-0.21.8: An optimising compiler for a functional, array-oriented language.
Safe HaskellNone
LanguageHaskell2010

Futhark.CodeGen.ImpGen.GPU.SegScan.SinglePass

Description

Code generation for segmented and non-segmented scans. Uses a fast single-pass algorithm, but which only works on NVIDIA GPUs and with some constraints on the operator. We use this when we can.

Synopsis

Documentation

compileSegScan :: Pat LetDecMem -> SegLevel -> SegSpace -> SegBinOp GPUMem -> KernelBody GPUMem -> CallKernelGen () Source #

Compile SegScan instance to host-level code with calls to a single-pass kernel.