Deep neural networks for kannada phoneme recognition

机译：用于卡纳达语音素识别的深度神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural network (DNN) based speech recognizers have recently replaced Gaussian Mixture Model (GMM) based systems as the state-of-the-art. Developing a phonetic engine and enhancing its performance can lead to significant improvement in Automatic Speech Recognition (ASR). However only a less work has been reported in developing Phonetic engine on large vocabulary Kannada speech corpus. In this paper, the comparative study of speech recognition baselines: HMM-GMM, HMM-ANN and HMM-DNN are analyzed. Our first set of experiments use the Kannada speech corpus, which contains continuous utterances recorded in three different modes namely read mode, lecture mode and conversation mode. Context independent phone modeling is carried out on the three baselines and evaluated on different modes of the corpus. Phone Error Rate is measured and compared on all the three baselines. Acoustic modeling using HMM-DNN baseline shows significant improvement of about 7-8 % over HMM-GMM and HMM-ANN baselines.

机译：基于深度神经网络（DNN）的语音识别器最近已取代基于高斯混合模型（GMM）的系统成为最新技术。开发语音引擎并增强其性能可以大大改善自动语音识别（ASR）。但是，在大型词汇卡纳达语语料库上开发语音引擎的工作报道较少。本文分析了语音识别基线的比较研究：HMM-GMM，HMM-ANN和HMM-DNN。我们的第一组实验使用了Kannada语音语料库，该语料库包含以三种不同模式（即朗读模式，演讲模式和对话模式）记录的连续语音。与上下文无关的电话建模在三个基线上进行，并在语料库的不同模式下进行评估。在所有三个基准上测量并比较“电话错误率”。使用HMM-DNN基线进行的声学建模显示，与HMM-GMM和HMM-ANN基线相比，显着改善了约7-8％。

著录项

来源
《International Conference on Contemporary Computing》|2016年|1-6|共6页
会议地点
作者
R Pradeep; K. Sreenivasa Rao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Speech; Neural networks; Speech recognition; Training; Engines; Stacking;

机译：隐马尔可夫模型;语音;神经网络;语音识别;训练;引擎;堆叠;

相似文献

外文文献
中文文献
专利

1. Handwritten Kannada numerals recognition using deep learning convolution neural network (DCNN) classifier [J] . Vishweshwrayya C. Hallur, R. S. Hegadi CSI Transactions on ICT . 2020,第3期

机译：手写的Kannada数字识别使用深度学习卷积神经网络（DCNN）分类器
2. Online phoneme recognition using multi-layer perceptron networks combined with recurrent non-linear autoregressive neural networks with exogenous inputs [J] . Bonilla Cardona Diana A., Nedjah Nadia, Mourelle Luiza M. Neurocomputing . 2017,第nova22期

机译：使用多层感知器网络结合具有外部输入的递归非线性自回归神经网络的在线音素识别
3. Deep belief networks for phoneme recognition in continuous Tamil speech-an analysis [J] . Laxmi Sree Baskaran Raguram, Vijaya Madhaya Shanmugam Revue du Cethedec . 2017,第3a4期

机译：泰米尔语连续语音中用于音素识别的深度信念网络-分析
4. Deep neural networks for kannada phoneme recognition [C] . R Pradeep, K. Sreenivasa Rao International Conference on Contemporary Computing . 2016

机译：kannada音素识别的深神经网络
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Deep Sensing: Inertial and Ambient Sensing for Activity Context Recognition Using Deep Convolutional Neural Networks [O] . Abayomi Otebolaku, Timibloudi Enamamu, Ali Alfoudi, 2020

机译：深度传感：使用深卷积神经网络的活动语境识别惯性和环境感测
7. Cascade Deep Neural Networks Classifiers for Phonemes Recognition [O] . Mohammad Smit, Abdel-Nasser Al-Assimi 2020

机译：Cascade深神经网络的音素识别分类

Deep neural networks for kannada phoneme recognition

摘要

著录项

相似文献

相关主题

期刊订阅