Description
🚀 Feature
Provide a pretrained MobileNet-V2 backbone for
- Instance Segmentation
- Semantic Segmentation
Motivation
As ecosystems mature and more object detection / segmentation libraries are released, they always leverage Torchvision pretrained models, or use Torchvision as a template for which architectures are supported (at least when they start out).
Mmdetection, Mmsegmentation are two great examples of this.
Heavy ResNet like backbones aren't feasible to run on mobile devices, and it's rough to train a model on COCO from scratch because of the sheer resources it demands. Having a pretrained model would facilitate quicker experimentation and broader PyTorch impact overall.
By providing a MobileNet backbone, I think Torchvision would have a significant cascading impact on the extended PyTorch ecosystem.
Pitch
Train Faster Mask R-CNN and DeepLabV3 models with a MobileNet-V2 backbone using canonical PyTorch scripts. Perhaps it also makes sense to add R-CNN, Keypoint R-CNN and FCN to that list given the existing pretrained models
Alternatives
- Wait for Ross Wightman to add segmentation models to https://github.com/rwightman/efficientdet-pytorch
- TensorFlow and its extended ecosystem
cc @vfdev-5