Visual Question Answering On Msvd Qa 2
Metrics
Accuracy
Results
Performance results of various models on this benchmark
| Paper Title | ||
|---|---|---|
| FrozenBiLM | 0.548 | Zero-Shot Video Question Answering via Frozen Bidirectional Language Models |
| Just Ask | 0.463 | Just Ask: Learning to Answer Questions from Millions of Narrated Videos |
0 of 2 row(s) selected.