Download zip Select Archive Format
Name Last Update history
File empty ..
File dir conf Loading commit data...
File dir local Loading commit data...
File txt README.txt Loading commit data...
File txt RESULTS Loading commit data...
File txt cmd.sh Loading commit data...
File txt path.sh Loading commit data...
File txt run.sh Loading commit data...
File txt steps Loading commit data...
File txt utils Loading commit data...

README.txt

Zeroth-Korean kaldi example is from Zeroth Project. Zeroth project introduces free Korean speech corpus and aims to make Korean speech recognition more broadly accessible to everyone. This project was developed in collaboration between Lucas Jo(@Atlas Guide Inc.) and Wonkyum Lee(@Gridspace Inc.). 

In this example, we are using 51.6 hours transcribed Korean audio for training data (22,263 utterances, 105 people, 3000 sentences) and 1.2 hours transcribed Korean audio for testing data (457 utterances, 10 people). Besides audio and transcription, we provide pre-trained/designed language model, lexicon and morpheme-based segmenter(morfessor)

The database can be also downloaded from openslr:
http://www.openslr.org/40

The database is licensed under Attribution 4.0 International (CC BY 4.0)

This folder contains a speech recognition recipe which is based on WSJ/Librispeech example.

For more details about Zeroth project, please visit:
https://github.com/goodatlas/zeroth