Text and Audio Captchas Text and Audio Captchas Dataset
The dataset contains 100k CAPTCHA samples, including:
- 50k text-based image captchas (
.png) - 50k audio-based CAPTCHAs (
.mp3) - Metadata CSV maps each CAPTCHA file to its correct label
Each CAPTCHA is labeled with its corresponding alphanumeric string, which makes it ideal for training OCR models, speech recognition, and AI-based CAPTCHA solvers.
Dataset structure
| Folder Name | Description |
|---|---|
Text/ | 50,000 text captcha images ( .png) |
Audio/ | 50,000 audio CAPTCHA files (.mp3) |
Metadata.csv | Map the CAPTCHA file to a CSV file of labels |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.