Cross Modal Retrieval On Recipe1M 1
Metrics
Image-to-text R@1
Text-to-image R@1
Results
Performance results of various models on this benchmark
| Paper Title | |||
|---|---|---|---|
| VLPCook | 45.2 | 47.3 | Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval |
| Marin et al. | 17 | 21 | Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images |
0 of 2 row(s) selected.