Kaldi Gale Recipe
Abstract
This package includes files for building Arabic ASR using the GALE database from LDC and the Kaldi Speech Recognition Toolkit. The test set is mix of conversational and report speech.
Related publications
- A. Ali, Y. Zhang, P. Cardinal, N. Dahak, S. Vogel, and J. Glass, “A complete kaldi recipe for building arabic speech recognition systems,” in Spoken language technology workshop (slt), 2014 ieee, 2014, pp. 525-529. doi:10.1109/SLT.2014.7078629
[BibTeX]@INPROCEEDINGS{aaliArabicKaldi, author={Ali, A. and Yifan Zhang and Cardinal, P. and Dahak, N. and Vogel, S. and Glass, J.}, booktitle={Spoken Language Technology Workshop (SLT), 2014 IEEE}, title={A complete KALDI recipe for building Arabic speech recognition systems}, year={2014}, month={Dec}, pages={525-529}, doi={10.1109/SLT.2014.7078629} }
- Install and compile Kaldi.
- Untar the gale_recipe.tar into egs folder
tar xvf gale_recipe.tar -C kaldi-trunk/egs - Modify run.sh:
- Adjust the number of jobs accordingly, default nJobs=120 It will also depend if you use queue or local machine, look at cmd.sh
- Change the data settings to point to GALE database: example: LDC2013S02_1=/alt/data/speech/LDC/LDC2013S02/gale_p2_arb_bc_speech_p1_d1
- Start run.sh
The script will build GMM, GMM+MPE, GMM+bMMI, SGMM+fMLLR, and SGMM+bMMI The RESULTS file has the WER if you use the script along with QCRI pronunciation dictionary.