Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Speedup amax triton
#385 opened Nov 26, 2025 by matthiasdiener Draft
13 tasks
Upcoming ROCm and JAX 0.8 support
#383 opened Nov 26, 2025 by ipanfilo Loading…
2 of 13 tasks
Enable AOTriton BWD V3 API
#382 opened Nov 25, 2025 by Micky774 Loading…
13 tasks
Old FP8 support code cleanup
#379 opened Nov 24, 2025 by ipanfilo Loading…
1 of 13 tasks
Re-enable supported GEMM configs
#378 opened Nov 24, 2025 by ipanfilo Loading…
13 tasks
Layernorm forward optimization
#377 opened Nov 24, 2025 by eliotwang Loading…
13 tasks
IFU dev v2.6
#374 opened Nov 19, 2025 by wangye805 Loading…
9 of 13 tasks
PyTorch FA test fix
#370 opened Nov 12, 2025 by Micky774 Loading…
13 tasks
Current scaling: two-stage amax kernel
#369 opened Nov 12, 2025 by matthiasdiener Loading…
3 of 15 tasks
Userbuffer epic
#367 opened Nov 11, 2025 by alextmagro Draft
JAX FA Benchmarking Script
#351 opened Oct 24, 2025 by Micky774 Loading…
13 tasks
[NO MERGE] Release v2.4 rocm
#334 opened Oct 8, 2025 by alextmagro Loading…
Triton norms dispatch refactor
#305 opened Sep 5, 2025 by Micky774 Loading…
13 tasks
heyi's layernorm optimization
#225 opened Jul 3, 2025 by eliotwang Loading…
8 of 13 tasks
Added Dockerfile for CI images
#195 opened May 28, 2025 by VeeraRajasekhar Loading…
7 of 13 tasks
[ROCm] support triton-based flash-attn in TE
#177 opened May 1, 2025 by wangye805 Loading…
8 of 13 tasks
Update attention example attention.ipynb
#152 opened Mar 19, 2025 by anhminhnguyenhoang Loading…
5 of 13 tasks
Honor the NVTE_FUSED_ATTN_<backend> in test_fused_attn.py
#123 opened Feb 11, 2025 by wangye805 Loading…
13 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.