Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix the sm120 compilation with CUDA 12
#2482 opened Dec 5, 2025 by ptrendx Loading…
1 of 13 tasks
[PyTorch] Add THD support for max_logit/MuonClip 2.11.0
#2480 opened Dec 4, 2025 by cyanguwa Loading…
8 of 13 tasks
Add support for SWA (left, right) with FusedAttention
#2477 opened Dec 4, 2025 by sudhakarsingh27 Loading…
22 of 28 tasks
fix ce loss calculation when some tokens are ignored bug Something isn't working
#2476 opened Dec 4, 2025 by yashaswikarnati Loading…
1 of 13 tasks
[JAX] Einsum with quantization
#2474 opened Dec 3, 2025 by phu0ngng Draft
13 tasks
[Draft] Jax primitives for permutation on single GPU
#2473 opened Dec 3, 2025 by tdophung Loading…
13 tasks
[PyTorch] Documentation for op fuser API documentation Improvements or additions to documentation
#2447 opened Dec 3, 2025 by timmoon10 Loading…
8 of 13 tasks
Add ccache support to TE and use it in GitHub actions
#2444 opened Dec 2, 2025 by ptrendx Draft
1 of 6 tasks
[PyTorch] Enable post-RHT amax estimation
#2442 opened Dec 2, 2025 by negvet Loading…
1 of 13 tasks
[pyTorch] CPU performance optimizations
#2439 opened Dec 1, 2025 by ptrendx Draft
13 tasks
support cuda graph capture offloading module
#2435 opened Dec 1, 2025 by lhb8125 Draft
13 tasks
[PyTorch] Add FA4 Support
#2432 opened Nov 28, 2025 by yaox12 Draft
1 of 16 tasks
[Pytorch][Bug]MXFP8 Split tensor Bug fix
#2427 opened Nov 26, 2025 by vthumbe1503 Loading…
2 of 13 tasks
[PyTorch] Convert sample tuple to list in cudagraph input reuse
#2426 opened Nov 26, 2025 by buptzyb Loading…
13 tasks
Fix FusedAdam DTensor compatibility issue
#2425 opened Nov 26, 2025 by shjwudp Loading…
13 tasks
[JAX] Wrapper for Permutation Triton kernel MoE
#2419 opened Nov 25, 2025 by tdophung Draft
9 of 16 tasks
[Common] Add kFloat64 partial support
#2417 opened Nov 24, 2025 by phu0ngng Loading…
7 of 13 tasks
[Common] Persistent NVFP4 cast + transpose kernel 2.11.0 enhancement New feature or request performance Performance issues
#2412 opened Nov 21, 2025 by Oleg-Goncharov Loading…
6 of 13 tasks
Enables specified cp rank slicing
#2387 opened Nov 14, 2025 by jomitchellnv Loading…
1 of 13 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.