LSVTD Video Text Understanding Dataset
Date
Publish URL
Paper URL
License
Other

LSVTD stands for large-scale video text dataset, which contains 100 videos from 21 natural scenes. The dataset covers a wide range of 13 indoor (such as bookstores, shopping malls) and 9 outdoor scenes, and its diversity is more than 3 times that of the IC15 dataset.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.