Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PyTorch] Improve L2Normalization basic op
#1964 opened Jul 18, 2025 by negvet Draft
6 of 13 tasks
[PyTorch] Remove GH pinned deps 2.6.0
#1961 opened Jul 17, 2025 by ksivaman Loading…
8 of 13 tasks
[Test] Enable cuDNN Norm tests in the CPP suite
#1957 opened Jul 16, 2025 by phu0ngng Loading…
6 of 13 tasks
[PyTorch Debug] Debug support for GroupedLinear
#1953 opened Jul 15, 2025 by pggPL Draft
13 tasks
[PyTorch] Refactor C++ quantizer infrastructure
#1952 opened Jul 15, 2025 by timmoon10 Draft
2 of 13 tasks
Refactor te.ops
#1951 opened Jul 15, 2025 by janekb04 Loading…
6 of 13 tasks
[PyTorch] Support delay_wgrad_compute cudagraph
#1948 opened Jul 14, 2025 by buptzyb Loading…
2 of 13 tasks
[Minor] Update 1_getting_started.rst documentation Improvements or additions to documentation
#1947 opened Jul 13, 2025 by dupeljan Loading…
3 of 8 tasks
[JAX] Select cuDNN backend for normalization 2.6.0
#1946 opened Jul 11, 2025 by phu0ngng Loading…
13 tasks
[Common] Skip cuDNN 9.10.0/9.10.1 due to bugs 2.6.0
#1937 opened Jul 8, 2025 by cyanguwa Loading…
8 of 13 tasks
[BUILD] Exclude ninja from required packages
#1932 opened Jul 7, 2025 by phu0ngng Loading…
5 of 13 tasks
Fix import error when flash attention 3 is installed
#1913 opened Jun 30, 2025 by HollowMan6 Loading…
7 of 13 tasks
[PyTorch debug] Improve precision debug tools performance
#1909 opened Jun 30, 2025 by pggPL Loading…
9 of 13 tasks
[PyTorch] Support FA3 MLA CP feature
#1907 opened Jun 28, 2025 by zhujian19891203 Loading…
7 of 13 tasks
[PyTorch Debug] Support log fp8 tensor stats for blockwise recipe
#1905 opened Jun 27, 2025 by lengerfulluse Loading…
12 tasks
[Common] NVFP4 kernels enhancement New feature or request
#1904 opened Jun 27, 2025 by Oleg-Goncharov Draft
5 of 13 tasks
Fix fp8_calibration path
#1903 opened Jun 27, 2025 by sudhakarsingh27 Draft
1 of 13 tasks
[PyTorch Debug] More advanced stats for Quantized Tensors
#1897 opened Jun 26, 2025 by pggPL Loading…
2 of 13 tasks
[Pytorch] CP + THD + chunked attention support.
#1887 opened Jun 17, 2025 by pggPL Loading…
8 of 13 tasks
pipeline aware cpu offload
#1886 opened Jun 17, 2025 by liuzhenhai93 Loading…
8 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.