NVIDIA / Fuser Public

Notifications You must be signed in to change notification settings
Fork 60
Star 333

Code
Issues 253
Pull requests 162
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Pull requests: NVIDIA/Fuser

Labels 47 Milestones 0

New pull request New

162 Open 3,565 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Cupti timer

#4614 opened Jun 11, 2025 by Priya2698 • Draft

adding fallback for bmm

#4613 opened Jun 11, 2025 by jjsjann123 • Draft

Debug-dump communications

#4612 opened Jun 10, 2025 by wujingyue

Loading…

[WIP] Always do CGA split in persistent Hopper matmul

#4610 opened Jun 10, 2025 by jacobhinkle • Draft

[WIP] Inline stmatrix with TMA store

#4609 opened Jun 10, 2025 by jacobhinkle • Draft

CUB-based block-parallel topk implementation as a device func

#4607 opened Jun 10, 2025 by naoyam

Loading…

Retain sharding annotations in privatizeUpcast

#4606 opened Jun 10, 2025 by Priya2698

Loading…

auto select between warp specialized and multi-wave approaches

#4603 opened Jun 9, 2025 by liqiangxl • Draft

add static warp all reduce

#4599 opened Jun 8, 2025 by liqiangxl • Draft

Experimental argsort codegen support

#4598 opened Jun 8, 2025 by naoyam

Loading…

Move getOuterBroadcastTvs() to normalization_utils.h

#4594 opened Jun 8, 2025 by liqiangxl

Loading…

[DO NOT REVIEW] Adding index_shuffling

#4588 opened Jun 6, 2025 by jjsjann123 • Draft

Privatize squeeze in addition to upcast

#4583 opened Jun 5, 2025 by protonu

Loading…

LLVM lowering

#4581 opened Jun 5, 2025 by wolfcomos

Loading…

Generate ldstmatrix shared memory address using IdModel Matmuls

#4579 opened Jun 5, 2025 by rdspring1

Loading…

Create alternate loop domain for ldstmatrix shared memory indexing Matmuls

#4578 opened Jun 5, 2025 by rdspring1

Loading…

Enable ping-pong in inner outer normalization

#4577 opened Jun 5, 2025 by liqiangxl • Draft

Add option to insert resharding after

#4574 opened Jun 4, 2025 by samnordmann

Loading…

Add destructor to FusionProfiler to teardown CUPTI

#4565 opened Jun 3, 2025 by Priya2698

Loading…

Modify matmul scheduler to scheduler alternate_loop_domain for ldstmatrix Matmuls

#4551 opened May 30, 2025 by rdspring1

Loading…

Improve Hopper matmul heuristic to enable larger CGAs

#4547 opened May 30, 2025 by jacobhinkle

Loading…

Respect min and max of inputs to create more precise repro scripts

#4535 opened May 28, 2025 by crcrpar

Loading…

enable b200 pingpong test

#4530 opened May 28, 2025 by liqiangxl • Draft

Remove retry on OOM, error handling in timer

#4526 opened May 27, 2025 by Priya2698

Loading…

Recreate python_frontend test_basic for nvfuser_direct Direct Bindings

Python extension with direct mapping to NvFuser CPP objects.

Python API

Issues related to the Python API

#4521 opened May 26, 2025 by rdspring1 • Draft

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!