Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Overlap: Skip last iter on length
#5211 opened Jun 13, 2025 by IzzyPutterman Loading…
[fix] Fix Llama4 min-latency import error
#5209 opened Jun 13, 2025 by nv-yilinf Loading…
[feat] Add EAGLE3 support for Qwen3
#5206 opened Jun 13, 2025 by nv-yilinf Loading…
feat: Enable EPLB to existing MoE models
#5203 opened Jun 13, 2025 by syuoni Loading…
[fix][test] Speedup Nemotron NAS unittests
#5202 opened Jun 13, 2025 by omera-nv Loading…
Test
#5199 opened Jun 13, 2025 by ZhanruiSunCh Draft
Merge current waive list with the ToT waive list
#5198 opened Jun 13, 2025 by yiqingy0 Loading…
tests: add ds r1 tp4 test
#5197 opened Jun 13, 2025 by xinhe-nv Draft
tests: add multi nodes tests
#5196 opened Jun 13, 2025 by xinhe-nv Draft
test: add deepseek rcca cases
#5195 opened Jun 13, 2025 by ruodil Loading…
refactor: dummy request creation
#5192 opened Jun 13, 2025 by lfr-0531 Loading…
[chore] Linking fixes to NVRTC wrapper Community want to contribute PRs initiated from Community
#5189 opened Jun 13, 2025 by AlessioNetti Loading…
test: add llama4 models for perf test
#5187 opened Jun 13, 2025 by ruodil Loading…
add dgx b200 8gpu test case in post merge
#5185 opened Jun 13, 2025 by yuanjingx87 Loading…
feat: MoE trtllm backend kernel update
#5183 opened Jun 13, 2025 by rosenrodt Loading…
[doc] Update Perf-Overview.MD with V0.20 Release Data
#5176 opened Jun 13, 2025 by zbpatel Loading…
ProTip! Follow long discussions with comments:>50.