forked from NVIDIA/cutlass
-
Notifications
You must be signed in to change notification settings - Fork 68
Pull requests: intel/sycl-tla
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
MXFP4/MXFP8/int4 weights support in CuTe interface MoE GEMM example
#640
opened Nov 21, 2025 by
sanchitintel
Loading…
Added limitation check in gemm_with_epilogue_softmax example test
#636
opened Nov 19, 2025 by
kausikmaiti
•
Draft
Xe2 GEMMs with microscaling format weights as well as int4 weights with FP16/BF16 scales
#633
opened Nov 17, 2025 by
sanchitintel
Loading…
[Experiment] Evaluate perf impact of striped vs. blocked SLM read/write 1D copy atoms
#631
opened Nov 15, 2025 by
sanchitintel
•
Draft
1 task done
Changes for fix flash attention KV cache and prefill issues
#617
opened Nov 6, 2025 by
rishi-yadav
Loading…
[CUTLASS9-297] Downgrade driver to 25.35 and update action.yml
#596
opened Oct 31, 2025 by
aschabana
Loading…
Add CuTe Matrix Transpose tutorial
examples
Label for adding examples, complex kernels development using cutlass or cute APIS
information required
The PR requires more information to review them properly
Add python API for flash-attn
information required
The PR requires more information to review them properly
redesign required
Implementation require a redesign
wontfix
This will not be worked on
#558
opened Oct 13, 2025 by
YangKai0616
Loading…
Rewrite mma unit tests
Tests
For Unit tests and Benchmark tests and general validation specific changes
#557
opened Oct 13, 2025 by
yuanhang-dev
Loading…
Skip alignment check for sourceless epilogues
bug
Something isn't working
#555
opened Oct 13, 2025 by
nsingh-habana
•
Draft
[CI][WIP] Fix coverity workflow
Tests
For Unit tests and Benchmark tests and general validation specific changes
First version of SDPA Fwd - No need to review
redesign required
Implementation require a redesign
#548
opened Oct 6, 2025 by
cfgfung
Loading…
upload 2nd version of sdpa backward
redesign required
Implementation require a redesign
#546
opened Oct 3, 2025 by
yuankuns
Loading…
Support of FP8 Chunk Prefill kernel
redesign required
Implementation require a redesign
#542
opened Oct 1, 2025 by
adityachatter
Loading…
Support
nullptr value of argument ptr_C for xe_array_epilogue
#541
opened Sep 29, 2025 by
sanchitintel
Loading…
Attention sink support
redesign required
Implementation require a redesign
#533
opened Sep 25, 2025 by
kareemshaik80
Loading…
Add dimension check to prevent out-of-bounds access in example 05_bmg_gemm_with_epilogue_splitk
#529
opened Sep 23, 2025 by
ClarkChin08
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.