VCBench Mathematical Reasoning Benchmark Dataset
VCBench is a benchmark dataset for evaluating multimodal mathematical reasoning with explicit visual dependencies, released by Alibaba and Zhejiang University in 2025. The dataset contains 1,720 question-answer pairs and a total of 6,697 images.
The questions mainly include the following 6 areas:
- Time and Calendar: Tests temporal reasoning questions across two subcategories (Calendar and Clock), requiring an understanding of time intervals and calendar-based calculations.
- Space and Position: Challenges focus on spatial reasoning in three subcategories (direction, position, and place) to assess understanding of relative position, direction, and spatial relationships.
- Geometry and Shapes: Questions covering five subcategories (angles, quadrilaterals, rectangles, shapes, and triangles) test basic geometric understanding from basic shape recognition to more complex property analysis.
- Objects and Motion: Tasks in two subcategories (Cube and Move) that assess understanding of three-dimensional objects and motion transformations.
- Reasoning and Observation: Questions in both subcategories (Inference and Observation) are designed to test logical reasoning and careful visual observation skills.
- Organization and Patterns: Challenges across three subcategories (Organization, Pattern, and Weight), assessing pattern recognition, sequencing, and organizational logic.

Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.