Deep Neural Network Based Continuous Speech Recognition for Serbian Using the Kaldi Toolkit

机译：使用Kaldi工具包的基于深度神经网络的塞尔维亚语连续语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a deep neural network (DNN) based large vocabulary continuous speech recognition (LVCSR) system for Serbian, developed using the open-source Kaldi speech recognition toolkit. The DNNs are initialized using stacked restricted Boltzmann machines (RBMs) and trained using cross-entropy as the objective function and the standard error backpropagation procedure in order to provide posterior probability estimates for the hidden Markov model (HMM) states. Emission densities of HMM states are represented as Gaussian mixture models (GMMs). The recipes were modified based on the particularities of the Serbian language in order to achieve the optimal results. A corpus of approximately 90 hours of speech (21000 utterances) is used for the training. The performances are compared for two different sets of utterances between the baseline GMM-HMM algorithm and various DNN settings.

机译：本文介绍了使用开源Kaldi语音识别工具包开发的基于深度神经网络（DNN）的塞尔维亚语大词汇量连续语音识别（LVCSR）系统。 DNN使用堆叠式受限Boltzmann机器（RBM）进行初始化，并使用交叉熵作为目标函数和标准误差反向传播过程进行训练，以便为隐马尔可夫模型（HMM）状态提供后验概率估计。 HMM状态的发射密度表示为高斯混合模型（GMM）。为了达到最佳效果，根据塞尔维亚语言的特殊性对配方进行了修改。大约90个小时的语音语料库（21000话语）用于训练。比较了基线GMM-HMM算法和各种DNN设置之间两组不同发音的性能。

著录项

来源
《International Conference on speech and computer》|2015年|186-192|共7页
会议地点
作者
Branislav Popovic; Stevan Ostrogonac; Edvin Pakoci; Niksa Jakovljevic; Vlado Delic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Kaldi speech recognition toolkit; Continuous speech recognition; Deep neural networks; Serbian;

机译：Kaldi语音识别工具包;连续语音识别;深度神经网络;塞尔维亚;

相似文献

外文文献
中文文献
专利

1. DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit [J] . Jyoti Guglani, A. N. Mishra International journal of speech technology . 2021,第1期

机译：基于DNN基于Kaldi Toolkit的Punjabi语言的连续语音识别系统
2. Continuous Punjabi speech recognition model based on Kaldi ASR toolkit [J] . Jyoti Guglani, A. N. Mishra International journal of speech technology . 2018,第2期

机译：基于Kaldi ASR工具包的旁遮普语连续语音识别模型
3. An Unsuper vised Adaptation Method for Deep Neural Network-based Large Vocabulary Continuous Speech Recognition [J] . Yeming Xiao, Yujing Si, Ji Xu, Journal of information and computational science . 2014,第14期

机译：基于深度神经网络的大词汇量连续语音识别的无监督自适应方法
4. Deep Neural Network Based Continuous Speech Recognition for Serbian Using the Kaldi Toolkit [C] . Branislav Popovic, Stevan Ostrogonac, Edvin Pakoci, Speech and Computer International Conference . 2015

机译：基于塞尔维亚使用Kaldi Toolkit的深度神经网络的连续语音识别
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Convolutional neural networks-based continuous speech recognition using raw speech signal [O] . Dimitri Palaz, Mathew Magimai. -doss, Ronan Collobert 2015

机译：基于卷积神经网络的连续语音识别使用原始语音信号

Deep Neural Network Based Continuous Speech Recognition for Serbian Using the Kaldi Toolkit

摘要

著录项

相似文献

相关主题

期刊订阅