Speech Recognition
Find speech corpora, audio-text pairs, and ASR evaluation resources.
ASR
Speech
<!doctype html>
Start from filtered catalog views, then open each original source to inspect licensing, download instructions, and dataset quality.
Use these entry points when you already know the kind of data you need.
Find speech corpora, audio-text pairs, and ASR evaluation resources.
Find voice data and references useful for training or evaluating Pashto TTS systems.
Find parallel corpora, text collections, dictionaries, and NLP datasets.
Find image, script, and document resources for Pashto OCR and text extraction.
Open the repository dataset index when you need the Markdown source used by maintainers.
Use the contribution notes and catalog rules before adding a new resource.