首页> 外文会议>International Conference on Intelligent Computation Technology and Automation >An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network
【24h】

An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network

机译:基于深神经网络的藏族语音识别方法改进

获取原文

摘要

Deep Neural Networks (DNN) are the dominant technique widely used in English and Chinese speech recognition currently. However, Tibetan speech recognition research starts late and mainly uses Hidden Markov Model (HMM). In this paper, We show a better method of replacing Gaussian Mixture Models (GMM) by DNN to Tibetan Lhasa dialect speech recognition system. The system contains seven layers of features and a very large number of parameters. The Tibetan Lhasa dialect speech recognition system is implemented by combining the ability of sequence modeling with HMM. DNN has recently proved to be very effective for a variety of machine learning problems. From the results, we can find that the word error rate(WER) of Tibetan speech recognition is reduced from 41.7% to 28.3% compared with the basic HMM.
机译:深度神经网络(DNN)是目前广泛应用于英语和中文语音识别的主要技术。然而,西藏语音识别研究开始迟到,主要使用隐马尔可夫模型(HMM)。在本文中,我们展示了DNN将高斯混合模型(GMM)更换更好的方法,以TIBETAN LHASA方言语音识别系统。该系统包含七层特征和非常大量的参数。通过组合序列建模与HMM的能力来实现西藏LHASA方言语音识别系统。 DNN最近证明对各种机器学习问题非常有效。从结果中,与基本嗯,我们可以发现藏语语音识别的错误率(WER)从41.7 %降至28.3 %。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号