Port all C++ ops to use the dispatcher

## 🚀 Feature

We currently manually handle CPU / CUDA / autograd dispatches in our wrapper function. We should instead use the dispatcher from PyTorch, which was built to do exactly that.

The work should closely follow the PR from @ezyang in https://github.com/pytorch/vision/pull/2366

## Motivation

The dispatcher is a new mechanism in PyTorch that selects which kernel to run depending on properties of the input tensors that were passed. The dispatcher is thus a centralized place where cpu / cuda / autograd / autocast / quantized / xla / etc are handled.

One thing to keep an eye on is that currently we need to duplicate the input checks for both CPU and CUDA functions. This is something that @ezyang is working on in https://github.com/pytorch/pytorch/pull/45277

Current support:
- [x] nms
- [x] roi_align
- [x] deform_conv2d
- [x] roi_pool
- [x] ps_roi_align
- [x] ps_roi_pool

---
Question for @ezyang : following our discussion in https://github.com/pytorch/vision/pull/2366/files#r447547554 , do you think we should be providing a fallback in PyTorch for registering ops without double backwards?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Port all C++ ops to use the dispatcher #2796

🚀 Feature

Motivation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Port all C++ ops to use the dispatcher #2796

Description

🚀 Feature

Motivation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions