Unsupervised Discovery of the Long-Tail in Instance Segmentation Using
Hierarchical Self-Supervision
Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision
Zhenzhen Weng Mehmet Giray Ogut Shai Limonchik Serena Yeung

Abstract
Instance segmentation is an active topic in computer vision that is usuallysolved by using supervised learning approaches over very large datasetscomposed of object level masks. Obtaining such a dataset for any new domain canbe very expensive and time-consuming. In addition, models trained on certainannotated categories do not generalize well to unseen objects. The goal of thispaper is to propose a method that can perform unsupervised discovery oflong-tail categories in instance segmentation, through learning instanceembeddings of masked regions. Leveraging rich relationship and hierarchicalstructure between objects in the images, we propose self-supervised losses forlearning mask embeddings. Trained on COCO dataset without additionalannotations of the long-tail objects, our model is able to discover novel andmore fine-grained objects than the common categories in COCO. We show that themodel achieves competitive quantitative results on LVIS as compared to thesupervised and partially supervised methods.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| novel-object-detection-on-lvis-v1-0-val | Weng et al. Weng et al. (2021)* | All mAP: 1.62 Known mAP: 17.85 Novel mAP: 0.27 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.