Visual Commonsense Reasoning On Vcr Qa R Dev
Metrics
Accuracy
Results
Performance results of various models on this benchmark
0 of 1 row(s) selected.
Performance results of various models on this benchmark
Performance results of various models on this benchmark