Missing a dataset? Open an issue to add yours.

Dataset Inventory

A citation-validated browser for language datasets visible in the research literature.

Click any row to see the full record: where the dataset comes from, how it is used, and how to access it.

records
languages covered
with dataset access
documented but not released
LanguageDatasetYearModalitySource Paper