Workgroup2 Reductions and Scans
Pre-release
Pre-release
·
1483 commits
to master
since this release
What's Changed
Workgroup Scans
nbl::hlsl::workgroup2 reduce + scan by @keptsecret in #876
Highly Performant, the subgroup emulated variant (Stone-Kogge adder made of subgroupShuffleUp) up to 200% faster than native (subgroupInclusiveAdd) on Nvidia RTX GPUs.
Blogpost incoming.
Full Changelog: v0.6.1-alpha1...v0.6.2-alpha1