Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
526 workflow runs
526 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[PyTorch] Add tests for current scaling; misc related fixes (#1606)
Deploy nightly docs #872: Commit 3bcd7f6 pushed by ksivaman
March 27, 2025 14:04 1m 46s main
March 27, 2025 14:04 1m 46s
[PyTorch] Optimize MXFP8 all-gathers (#1581)
Deploy nightly docs #871: Commit 0356010 pushed by timmoon10
March 25, 2025 23:09 1m 25s main
March 25, 2025 23:09 1m 25s
[PyTorch] Minor fixes for TE 2.2 (#1589)
Deploy nightly docs #870: Commit 65c2798 pushed by cyanguwa
March 25, 2025 22:40 1m 36s main
March 25, 2025 22:40 1m 36s
Fix mxfp8 columnwise data missing (#1593)
Deploy nightly docs #869: Commit abbdd76 pushed by timmoon10
March 25, 2025 22:10 1m 30s main
March 25, 2025 22:10 1m 30s
[PyTorch] Defer torch compilation steps until first function call (#1…
Deploy nightly docs #868: Commit cf00d53 pushed by timmoon10
March 25, 2025 22:09 1m 28s main
March 25, 2025 22:09 1m 28s
[PyTorch] Fix issues for MCore DDP in grouped GEMM. (#1609)
Deploy nightly docs #867: Commit b59d1d8 pushed by ksivaman
March 25, 2025 18:32 1m 22s main
March 25, 2025 18:32 1m 22s
Remove deprecated interval arg to delayed scaling recipe (#1607)
Deploy nightly docs #866: Commit 945a559 pushed by ksivaman
March 25, 2025 16:06 1m 33s main
March 25, 2025 16:06 1m 33s
[JAX] Fixing importing in the encoder examples (#1600)
Deploy nightly docs #865: Commit 3dc8c6b pushed by phu0ngng
March 25, 2025 14:23 1m 23s main
March 25, 2025 14:23 1m 23s
Ensure weight transpose is valid for Hopper FP8 training (#1596)
Deploy nightly docs #864: Commit 1321b9b pushed by timmoon10
March 24, 2025 18:54 1m 30s main
March 24, 2025 18:54 1m 30s
Fix issues in fused_attn_bwd (#1574)
Deploy nightly docs #863: Commit e14d147 pushed by xrennvidia
March 24, 2025 17:01 1m 32s main
March 24, 2025 17:01 1m 32s
[PyTorch] Enable fp8_primary_weights for current scaling (#1544)
Deploy nightly docs #862: Commit 8681389 pushed by timmoon10
March 22, 2025 07:46 1m 17s main
March 22, 2025 07:46 1m 17s
[PyTorch] Use consistent API for fused norm kernels (#1560)
Deploy nightly docs #861: Commit e80fbd7 pushed by timmoon10
March 22, 2025 01:17 1m 20s main
March 22, 2025 01:17 1m 20s
Update cudnn-frontend to new 1.11.0-rc commit (#1590)
Deploy nightly docs #860: Commit dd4c17d pushed by cyanguwa
March 20, 2025 17:57 1m 25s main
March 20, 2025 17:57 1m 25s
Parallelize CPU reference implementation in tests 2 (#1588)
Deploy nightly docs #859: Commit 96f9c6d pushed by timmoon10
March 19, 2025 21:52 1m 35s main
March 19, 2025 21:52 1m 35s
Changed VERSION to 2.3.0.dev0
Deploy nightly docs #858: Commit eee710a pushed by ptrendx
March 18, 2025 23:30 1m 54s main
March 18, 2025 23:30 1m 54s
Fix return_bias option in LayerNormLinear and LayerNormMLP (#1569)
Deploy nightly docs #857: Commit 99f4067 pushed by ptrendx
March 18, 2025 23:27 1m 56s main
March 18, 2025 23:27 1m 56s
[JAX] Fix softmax aux shapes for packed/THD format (#1575)
Deploy nightly docs #856: Commit bee4649 pushed by mgoldfarb-nvidia
March 18, 2025 14:21 1m 42s main
March 18, 2025 14:21 1m 42s
Add KV cache for paged/non-paged attention (#1355)
Deploy nightly docs #855: Commit 4f33ece pushed by cyanguwa
March 18, 2025 05:35 1m 38s main
March 18, 2025 05:35 1m 38s
Update full recompute feature to save recipe. (#1577)
Deploy nightly docs #854: Commit 05f6a69 pushed by ksivaman
March 18, 2025 04:14 1m 19s main
March 18, 2025 04:14 1m 19s
[QA] Add error handling (#1570)
Deploy nightly docs #853: Commit c571c2f pushed by timmoon10
March 17, 2025 21:52 1m 25s main
March 17, 2025 21:52 1m 25s
Distopt with offload (#1573)
Deploy nightly docs #852: Commit 6a85596 pushed by ksivaman
March 17, 2025 20:32 1m 23s main
March 17, 2025 20:32 1m 23s
Better cuBLAS handle management (#1389)
Deploy nightly docs #851: Commit 7ddc593 pushed by ksivaman
March 17, 2025 18:48 1m 26s main
March 17, 2025 18:48 1m 26s
Add issue template (#1584)
Deploy nightly docs #850: Commit 4a74ef8 pushed by ksivaman
March 17, 2025 18:46 1m 40s main
March 17, 2025 18:46 1m 40s
[PyTorch] Support TP Overlap in Per-Tensor Current Scaling Recipe (#1…
Deploy nightly docs #849: Commit a7eeb28 pushed by timmoon10
March 15, 2025 00:39 1m 28s main
March 15, 2025 00:39 1m 28s
Refactoring attention.py part 1 (#1542)
Deploy nightly docs #848: Commit 3733947 pushed by KshitijLakhani
March 14, 2025 23:53 1m 17s main
March 14, 2025 23:53 1m 17s