Xinlei Chen; Haoqi Fan; Ross Girshick; Kaiming He

Abstract
Contrastive unsupervised learning has recently shown encouraging progress, e.g., in Momentum Contrast (MoCo) and SimCLR. In this note, we verify the effectiveness of two of SimCLR's design improvements by implementing them in the MoCo framework. With simple modifications to MoCo---namely, using an MLP projection head and more data augmentation---we establish stronger baselines that outperform SimCLR and do not require large training batches. We hope this will make state-of-the-art unsupervised learning research more accessible. Code will be made public.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| contrastive-learning-on-imagenet-1k | ResNet50 | ImageNet Top-1 Accuracy: 71.1 |
| image-classification-on-places205 | MoCo v2 | Top 1 Accuracy: 52.9 |
| person-re-identification-on-sysu-30k | MoCo v2 (self-supervised) | Rank-1: 11.6 |
| self-supervised-image-classification-on | MoCo v2 (ResNet-50) | Number of Params: 24M Top 1 Accuracy: 71.1% Top 5 Accuracy: 90.1% |
| self-supervised-person-re-identification-on | MoCo v2 | Rank-1: 11.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.