首页> 外文会议>International Conference on Contemporary Computing >Deep neural networks for kannada phoneme recognition
【24h】

Deep neural networks for kannada phoneme recognition

机译:用于卡纳达语音素识别的深度神经网络

获取原文

摘要

Deep neural network (DNN) based speech recognizers have recently replaced Gaussian Mixture Model (GMM) based systems as the state-of-the-art. Developing a phonetic engine and enhancing its performance can lead to significant improvement in Automatic Speech Recognition (ASR). However only a less work has been reported in developing Phonetic engine on large vocabulary Kannada speech corpus. In this paper, the comparative study of speech recognition baselines: HMM-GMM, HMM-ANN and HMM-DNN are analyzed. Our first set of experiments use the Kannada speech corpus, which contains continuous utterances recorded in three different modes namely read mode, lecture mode and conversation mode. Context independent phone modeling is carried out on the three baselines and evaluated on different modes of the corpus. Phone Error Rate is measured and compared on all the three baselines. Acoustic modeling using HMM-DNN baseline shows significant improvement of about 7-8 % over HMM-GMM and HMM-ANN baselines.
机译:基于深度神经网络(DNN)的语音识别器最近已取代基于高斯混合模型(GMM)的系统成为最新技术。开发语音引擎并增强其性能可以大大改善自动语音识别(ASR)。但是,在大型词汇卡纳达语语料库上开发语音引擎的工作报道较少。本文分析了语音识别基线的比较研究:HMM-GMM,HMM-ANN和HMM-DNN。我们的第一组实验使用了Kannada语音语料库,该语料库包含以三种不同模式(即朗读模式,演讲模式和对话模式)记录的连续语音。与上下文无关的电话建模在三个基线上进行,并在语料库的不同模式下进行评估。在所有三个基准上测量并比较“电话错误率”。使用HMM-DNN基线进行的声学建模显示,与HMM-GMM和HMM-ANN基线相比,显着改善了约7-8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号