Skip to content

fix: replace broadcast multiply with matmul in sparse pooled attention

41ba612
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

fix(ds4): memory spike in sparse pooled attention at 4k+ context #17

fix: replace broadcast multiply with matmul in sparse pooled attention
41ba612
Select commit
Loading
Failed to load commit list.
Job log options

This job was skipped