How 2R Video Retrieval Dataset
Date
Publish URL
Paper URL
License
Other

How 2R is a dataset for text-based video retrieval. The dataset contains 24,328 60-second clips and 51,390 related query terms collected from 9,371 videos in the HowTo 100M dataset, with an average of 2-3 related query terms per clip. 80% of the data is used for training, 10% of the data is used for verification, and 10% of the data is used for testing.
How 2R and How 2QA are new challenging benchmarks that can be used to study the fields of video retrieval and video question answering.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.