Image Captioning On Conceptual Captions
Metrics
CIDEr
ROUGE-L
SPICE
Results
Performance results of various models on this benchmark
| Paper Title | ||||
|---|---|---|---|---|
| ClipCap (MLP + GPT2 tuning) | 87.26 | 26.71 | 18.5 | ClipCap: CLIP Prefix for Image Captioning |
| ClipCap (Transformer) | 71.82 | 25.12 | 16.07 | ClipCap: CLIP Prefix for Image Captioning |
0 of 2 row(s) selected.