Yannick Estève / ONTRAC-Kaldi

Download zip

Name	Last Update	Last Commit 8dcb6dfcb61 – first commit	history
..
conf	Loading commit data...
local	Loading commit data...
README.txt	Loading commit data...
cmd.sh	Loading commit data...
diarization	Loading commit data...
path.sh	Loading commit data...
run.sh	Loading commit data...
sid	Loading commit data...
steps	Loading commit data...
utils	Loading commit data...

README.txt

This recipe is the speaker diarization recipe for The First DIHARD Speech
 Diarization Challenge (DIHARD 2018). There are two tracks in the DIHARD 2018 
 competition , one uses oracle SAD (track1) and the other required that SAD 
 was performed from scratch (track2). This script is for track1.

 The recipe is closely based on the following paper:
 http://www.danielpovey.com/files/2018_interspeech_dihard.pdf but doesn't
 contain the VB refinement. The whole system mainly contains full-covariance
 GMM-UBM, i-vector extractor (T-matrix), PLDA scoring and agglomerative 
 hierarchical clustering. The VoxCeleb datasets are used for training i-vectors 
 and PLDA. The development set of the DIHARD 2018 competition is used as 
 validation set to tune parameters. The system is tested on the DIHARD 2018 
 evaluation set.