Skip to content

Branchless sinkhorn + native bfloat4 loads in fused collapse kernel

245f6cb
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

perf: optimize DeepSeek-V4 #13

Branchless sinkhorn + native bfloat4 loads in fused collapse kernel
245f6cb
Select commit
Loading
Failed to load commit list.
Job log options

This job was skipped