Valentin Khrulkov; Leyla Mirvakhabova; Evgeniya Ustinova; Ivan Oseledets; Victor Lempitsky

Abstract
Computer vision tasks such as image classification, image retrieval and few-shot learning are currently dominated by Euclidean and spherical embeddings, so that the final decisions about class belongings or the degree of similarity are made using linear hyperplanes, Euclidean distances, or spherical geodesic distances (cosine similarity). In this work, we demonstrate that in many practical scenarios hyperbolic embeddings provide a better alternative.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| few-shot-image-classification-on-cub-200-5 | Hyperbolic ProtoNet | Accuracy: 72.22 |
| few-shot-image-classification-on-cub-200-5-1 | Hyperbolic ProtoNet | Accuracy: 60.52 |
| few-shot-image-classification-on-mini-2 | Hyperbolic ProtoNet | Accuracy: 51.57 |
| few-shot-image-classification-on-mini-3 | Hyperbolic ProtoNet | Accuracy: 66.27 |
| few-shot-image-classification-on-omniglot-1-1 | Hyperbolic ProtoNet | Accuracy: 95.9% |
| few-shot-image-classification-on-omniglot-1-2 | Hyperbolic ProtoNet | Accuracy: 99.0 |
| few-shot-image-classification-on-omniglot-5-1 | Hyperbolic ProtoNet | Accuracy: 98.15% |
| few-shot-image-classification-on-omniglot-5-2 | Hyperbolic ProtoNet | Accuracy: 99.4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.