CSS10 Speech Dataset

CSS10 is a dataset of single-speaker speech in ten languages. The dataset contains short audio clips of LibriVox audiobooks and their calibration text. The researchers also trained two neural models for generating speech from text based on the speech dataset to verify the quality of the speech dataset. The dataset can be used for speech tasks in the future.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.