HyperAIHyperAI

Command Palette

Search for a command to run...

CAS-VSR-W1k Lip Reading Recognition Dataset

Date

3 years ago

Organization

Publish URL

vipl.ict.ac.cn

Paper URL

arxiv.org

License

Non-Commercial

Join the Discord Community
Featured Image

CAS-VSR-W1k, formerly known as LRW-1000, is the largest publicly available Mandarin lexical-level lip sync dataset. The dataset contains 1,000 word classes and 700,000 samples from more than 2,000 speakers. The dataset contains more than 1,000,000 Chinese character instances.

Each category corresponds to a syllable of a Mandarin word consisting of one or several Chinese characters. The dataset is designed to cover natural variations in different speech modes and imaging conditions to incorporate challenges encountered in real applications.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
CAS-VSR-W1k Lip Reading Recognition Dataset | Datasets | HyperAI