首页> 外文会议>International conference on mining intelligence and knowledge exploration >DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition
【24h】

DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition

机译:大词汇泰卢固语语音识别的DNN-HMM声学建模

获取原文

摘要

The main focus of this paper is towards the development of a large vocabulary Telugu speech database. Telugu is a low resource language where there exists no standardized database for building the speech recognition system (ASR). The database consists of neutral speech samples collected from 100 speakers for building the Telugu ASR system and it was named as IIIT-H Telugu speech corpus. The speech and text corpus design and the procedure followed for the collection of the database have been discussed in detail. The preliminary ASR system results for the models built in this database are reported. The architectural choices of deep neural networks (DNNs) play a crucial role in improving the performance of ASR systems. ASR trained with hybrid DNNs (DNN-HMM) with more hidden layers have shown better performance over the conventional GMMs (GMM-HMM). Kaldi tool kit is used for building the acoustic models required for the ASR system.
机译:本文的主要重点是开发大型词汇泰卢固语语音数据库。泰卢固语是一种资源不足的语言,其中没有用于构建语音识别系统(ASR)的标准化数据库。该数据库包括从100位演讲者那里收集的用于建立泰卢固语ASR系统的中性语音样本,该数据库被命名为IIIT-H泰卢固语语音语料库。详细讨论了语音和文本语料库的设计以及收集数据库所遵循的过程。报告了在此数据库中构建的模型的初步ASR系统结果。深度神经网络(DNN)的体系结构选择在提高ASR系统的性能方面起着至关重要的作用。与具有更多隐藏层的混合DNN(DNN-HMM)一起训练的ASR与传统GMM(GMM-HMM)相比,表现出更好的性能。 Kaldi工具套件用于构建ASR系统所需的声学模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号