Download zip Select Archive Format
Name Last Update history
File empty ..
File dir aidatatang_200zh Loading commit data...
File dir aishell Loading commit data...
File dir aishell2 Loading commit data...
File dir ami Loading commit data...
File dir an4 Loading commit data...
File dir apiai_decode Loading commit data...
File dir aspire Loading commit data...
File dir aurora4 Loading commit data...
File dir babel Loading commit data...
File dir babel_multilang Loading commit data...
File dir bentham Loading commit data...
File dir bn_music_speech Loading commit data...
File dir callhome_diarization Loading commit data...
File dir callhome_egyptian Loading commit data...
File dir chime1 Loading commit data...
File dir chime2 Loading commit data...
File dir chime3 Loading commit data...
File dir chime4 Loading commit data...
File dir chime5 Loading commit data...
File dir cifar Loading commit data...
File dir commonvoice Loading commit data...
File dir csj Loading commit data...
File dir dihard_2018 Loading commit data...
File dir fame Loading commit data...
File dir farsdat Loading commit data...
File dir fisher_callhome_spanish Loading commit data...
File dir fisher_english Loading commit data...
File dir fisher_swbd Loading commit data...
File dir formosa Loading commit data...
File dir gale_arabic Loading commit data...
File dir gale_mandarin Loading commit data...
File dir gp Loading commit data...
File dir heroico Loading commit data...
File dir hkust Loading commit data...
File dir hub4_english Loading commit data...
File dir hub4_spanish Loading commit data...
File dir iam Loading commit data...
File dir iban Loading commit data...
File dir ifnenit Loading commit data...
File dir librispeech Loading commit data...
File dir lre Loading commit data...
File dir lre07 Loading commit data...
File dir madcat_ar Loading commit data...
File dir madcat_zh Loading commit data...
File dir material Loading commit data...
File dir mgb5 Loading commit data...
File dir mini_librispeech Loading commit data...
File dir multi_en Loading commit data...
File dir ptb Loading commit data...
File dir reverb Loading commit data...
File dir rimes Loading commit data...
File dir rm Loading commit data...
File dir sitw Loading commit data...
File dir spanish_dimex100 Loading commit data...
File dir sprakbanken Loading commit data...
File dir sprakbanken_swe Loading commit data...
File dir sre08 Loading commit data...
File dir sre10 Loading commit data...
File dir sre16 Loading commit data...
File dir svhn Loading commit data...
File dir swahili Loading commit data...
File dir swbd Loading commit data...
File dir tedlium Loading commit data...
File dir thchs30 Loading commit data...
File dir tidigits Loading commit data...
File dir timit Loading commit data...
File dir tunisian_msa Loading commit data...
File dir uw3 Loading commit data...
File dir voxceleb Loading commit data...
File dir voxforge Loading commit data...
File dir vystadial_cz Loading commit data...
File dir vystadial_en Loading commit data...
File dir wsj Loading commit data...
File dir yesno Loading commit data...
File dir yomdle_fa Loading commit data...
File dir yomdle_korean Loading commit data...
File dir yomdle_russian Loading commit data...
File dir yomdle_tamil Loading commit data...
File dir yomdle_zh Loading commit data...
File dir zeroth_korean Loading commit data...
File txt README.txt Loading commit data...

README.txt

This directory contains example scripts that demonstrate how to 
use Kaldi.  Each subdirectory corresponds to a corpus that we have
example scripts for.

Note: we now have some scripts using free data, including voxforge,
vystadial_{cz,en} and yesno.  Most of the others are available from
the Linguistic Data Consortium (LDC), which requires money (unless you
have a membership).

If you have an LDC membership, probably rm/s5 or wsj/s5 should be your first
choice to try out the scripts.