DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition

机译：大词汇泰卢固语语音识别的DNN-HMM声学建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The main focus of this paper is towards the development of a large vocabulary Telugu speech database. Telugu is a low resource language where there exists no standardized database for building the speech recognition system (ASR). The database consists of neutral speech samples collected from 100 speakers for building the Telugu ASR system and it was named as IIIT-H Telugu speech corpus. The speech and text corpus design and the procedure followed for the collection of the database have been discussed in detail. The preliminary ASR system results for the models built in this database are reported. The architectural choices of deep neural networks (DNNs) play a crucial role in improving the performance of ASR systems. ASR trained with hybrid DNNs (DNN-HMM) with more hidden layers have shown better performance over the conventional GMMs (GMM-HMM). Kaldi tool kit is used for building the acoustic models required for the ASR system.

机译：本文的主要重点是开发大型词汇泰卢固语语音数据库。泰卢固语是一种资源不足的语言，其中没有用于构建语音识别系统（ASR）的标准化数据库。该数据库包括从100位演讲者那里收集的用于建立泰卢固语ASR系统的中性语音样本，该数据库被命名为IIIT-H泰卢固语语音语料库。详细讨论了语音和文本语料库的设计以及收集数据库所遵循的过程。报告了在此数据库中构建的模型的初步ASR系统结果。深度神经网络（DNN）的体系结构选择在提高ASR系统的性能方面起着至关重要的作用。与具有更多隐藏层的混合DNN（DNN-HMM）一起训练的ASR与传统GMM（GMM-HMM）相比，表现出更好的性能。 Kaldi工具套件用于构建ASR系统所需的声学模型。

著录项

来源
《International conference on mining intelligence and knowledge exploration》|2017年|189-197|共9页
会议地点
作者
Vishnu Vidyadhara Raju Vegesna; Krishna Gurugubelli; Hari Krishna Vydana; Bhargav Pulugandla; Manish Shrivastava; Anil Kumar Vuppala;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
DNNs; HMMs; GMM; ASR; MFCCs;

机译：DNN; HMM; GMM; ASR; MFCC;

相似文献

外文文献
中文文献
专利

1. Building DNN acoustic models for large vocabulary speech recognition [J] . Andrew L. Maas, Peng Qi, Ziang Xie, Computer speech and language . 2017,第jana期

机译：建立用于大词汇量语音识别的DNN声学模型
2. A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition [J] . Li Xiangang, Yang Yuning, Pang Zaihu, Neurocomputing . 2015,第deca25期

机译：基于大词汇量中文语音识别的深度神经网络中声学建模单元选择的比较研究
3. Boosting HMM acoustic models in large vocabulary speech recognition [J] . Meyer C, Schramm H Speech Communication . 2006,第5期

机译：在大词汇量语音识别中增强HMM声学模型
4. DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition [C] . Vishnu Vidyadhara Raju Vegesna, Krishna Gurugubelli, Hari Krishna Vydana, International Conference on Mining Intelligence and Knowledge Exploration . 2017

机译：大型词汇语音识别的DNN-HMM声学造型
5. Statistical optimization of acoustic models for large vocabulary speech recognition [D] . Hu, Rusheng 2006

机译：用于大词汇量语音识别的声学模型的统计优化
6. Intelligibility and Acoustic Characteristics of Clear and Conversational Speech in Telugu (A South Indian Dravidian Language) [O] . Naresh Durisala, S. G. R. Prakash, Arivudai Nambi, 2011

机译：泰卢固语（南印度德拉维语）中清晰对话的语音的可理解性和声学特征
7. Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition [O] . Soltau, Hagen, Liao, Hank, Sak, Hasim 2016

机译：神经语音识别器：用于大型的声学到单词LsTm模型词汇语音识别

DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅