VATEX Video Captioning Dataset
Date
Size
Publish URL
Paper URL
License
CC BY 4.0

VATEX, short for Video And TEXt, is a large multilingual video description dataset that includes 41,250 videos and 825,000 sets of Chinese and English subtitles. Among the subtitle texts, there are more than 206,000 English-Chinese translation pairs.
This dataset is mainly used for:
-Multi-language video subtitle generation
- Video subtitle translation
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.