Visual Madlibs Image Description Dataset
Date
Publish URL
Paper URL
License
Other

Visual Madlibs contains 360,001 natural language descriptions for 10,738 images. The dataset uses automatically generated fill-in-the-blank templates to collect descriptions of several targets, including: people and objects, appearance, activities and interactions, and inferences about general scenes or broader contexts.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.