Pashto Datasets

This page is for people searching for Pashto datasets across speech and text tasks.

Start Here

Dataset Coverage

  • Speech datasets for ASR and TTS.
  • Text corpora for NLP and MT.
  • Benchmark-ready subsets and metadata references.

Contribution

To add a dataset, follow dataset_guidelines.md and submit a PR with evidence, license, and task tags.