-
Notifications
You must be signed in to change notification settings - Fork 458
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[PyTorch] Debug linear layer when saving original input and using debug quantizer
2.6.0
#1963
opened Jul 17, 2025 by
timmoon10
Loading…
1 of 13 tasks
[Test] Enable cuDNN Norm tests in the CPP suite
#1957
opened Jul 16, 2025 by
phu0ngng
Loading…
6 of 13 tasks
[PyTorch][Mcore] Fix illegal memory access issue while using Mcore async checkpoint with fp8 tensorwise recipe
bug
Something isn't working
#1956
opened Jul 16, 2025 by
zhongbozhu
Loading…
13 tasks
[PyTorch][FP8 CS] Remove the unnecessary torch reciprocal op in fp8 current scaling code path
performance
Performance issues
#1950
opened Jul 14, 2025 by
zhongbozhu
Loading…
13 tasks
[PyTorch] Support delay_wgrad_compute cudagraph
#1948
opened Jul 14, 2025 by
buptzyb
Loading…
2 of 13 tasks
[Minor] Update 1_getting_started.rst
documentation
Improvements or additions to documentation
#1947
opened Jul 13, 2025 by
dupeljan
Loading…
3 of 8 tasks
[JAX] Select cuDNN backend for normalization
2.6.0
#1946
opened Jul 11, 2025 by
phu0ngng
Loading…
13 tasks
[Common] Skip cuDNN 9.10.0/9.10.1 due to bugs
2.6.0
#1937
opened Jul 8, 2025 by
cyanguwa
Loading…
8 of 13 tasks
[BUILD] Exclude ninja from required packages
#1932
opened Jul 7, 2025 by
phu0ngng
Loading…
5 of 13 tasks
[PyTorch] Fuse permute+pad and unpermute+unpad ops for FP8 optimization
#1921
opened Jul 3, 2025 by
xiaoxi-wangfj
Loading…
3 of 12 tasks
Fix import error when flash attention 3 is installed
#1913
opened Jun 30, 2025 by
HollowMan6
Loading…
7 of 13 tasks
[PyTorch debug] Improve precision debug tools performance
#1909
opened Jun 30, 2025 by
pggPL
Loading…
9 of 13 tasks
[PyTorch] Support FA3 MLA CP feature
#1907
opened Jun 28, 2025 by
zhujian19891203
Loading…
7 of 13 tasks
[PyTorch Debug] Support log fp8 tensor stats for blockwise recipe
#1905
opened Jun 27, 2025 by
lengerfulluse
Loading…
12 tasks
[Common] NVFP4 kernels
enhancement
New feature or request
#1904
opened Jun 27, 2025 by
Oleg-Goncharov
•
Draft
5 of 13 tasks
[PyTorch Debug] More advanced stats for Quantized Tensors
#1897
opened Jun 26, 2025 by
pggPL
Loading…
2 of 13 tasks
[Pytorch] CP + THD + chunked attention support.
#1887
opened Jun 17, 2025 by
pggPL
Loading…
8 of 13 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.