Visual Question Answering On Msrvtt Qa 2

Metrics

Accuracy

Results

Performance results of various models on this benchmark

		Paper Title
FrozenBiLM	0.470	Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Just Ask	0.415	Just Ask: Learning to Answer Questions from Millions of Narrated Videos
SSML	0.35	Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Aurora (ours, r=64) Aurora (ours, r=64)	-	-

0 of 4 row(s) selected.

Visual Question Answering On Msrvtt Qa 2

Metrics

Accuracy

Results

Performance results of various models on this benchmark

		Paper Title
FrozenBiLM	0.470	Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Just Ask	0.415	Just Ask: Learning to Answer Questions from Millions of Narrated Videos
SSML	0.35	Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Aurora (ours, r=64) Aurora (ours, r=64)	-	-

0 of 4 row(s) selected.

Visual Question Answering On Msrvtt Qa 2 | SOTA | HyperAI