VCR Visual Common Sense Reasoning Dataset
Date
Size
Publish URL
Paper URL
License
Other

VCR stands for Visual Commonsense Reasoning, which is a large-scale dataset for visual common sense reasoning. The dataset asks challenging questions about images, and the machine needs to complete two subtasks: answer the questions correctly and provide reasons to justify its answers.
The VCR dataset contains a large number of questions, 212K for training, 26K for validation, and 25K for testing. The answers and reasons come from more than 110K non-repeated movie scenes.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.