Robust combination of neural networks and hidden Markov models for speech recognition

Trentin E.; Gori M.

首页> 外文期刊>IEEE Transactions on Neural Networks >Robust combination of neural networks and hidden Markov models for speech recognition

【24h】

Robust combination of neural networks and hidden Markov models for speech recognition

机译：神经网络和隐马尔可夫模型的鲁棒组合，用于语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Acoustic modeling in state-of-the-art speech recognition systems usually relies on hidden Markov models (HMMs) with Gaussian emission densities. HMMs suffer from intrinsic limitations, mainly due to their arbitrary parametric assumption. Artificial neural networks (ANNs) appear to be a promising alternative in this respect, but they historically failed as a general solution to the acoustic modeling problem. This paper introduces algorithms based on a gradient-ascent technique for global training of a hybrid ANN/HMM system, in which the ANN is trained for estimating the emission probabilities of the states of the HMM. The approach is related to the major hybrid systems proposed by Bourlard and Morgan and by Bengio, with the aim of combining their benefits within a unified framework and to overcome their limitations. Several viable solutions to the "divergence problem"-that may arise when training is accomplished over the maximum-likelihood (ML) criterion-are proposed. Experimental results in speaker-independent, continuous speech recognition over Italian digit-strings validate the novel hybrid framework, allowing for improved recognition performance over HMMs with mixtures of Gaussian components, as well as over Bourlard and Morgan's paradigm. In particular, it is shown that the maximum a posteriori (MAP) version of the algorithm yields a 46.34% relative word error rate reduction with respect to standard HMMs.

机译：最新的语音识别系统中的声学建模通常依赖于具有高斯发射密度的隐马尔可夫模型（HMM）。 HMM受内在限制，主要是由于其任意的参数假设。在这方面，人工神经网络（ANN）似乎是一个有前途的替代方法，但从历史上看，它们不能作为声学建模问题的一般解决方案。本文介绍了一种基于梯度上升技术的混合ANN / HMM混合系统全局训练算法，其中对ANN进行训练以估计HMM状态的发射概率。该方法与Bourlard和Morgan以及Bengio提出的主要混合动力系统有关，目的是在统一框架内结合其优势并克服其局限性。针对“差异问题”，提出了几种可行的解决方案，这些解决方案可能是通过最大似然（ML）标准完成训练时出现的。在不依赖说话者的情况下，通过意大利数字字符串进行连续语音识别的实验结果验证了这种新型的混合框架，从而提高了混合高斯分量的HMM以及Bourlard和Morgan范例的识别性能。特别地，示出了算法的最大后验（MAP）版本相对于标准HMM产生46.34％的相对单词错误率降低。

著录项

来源
《IEEE Transactions on Neural Networks》 |2003年第6期|p.1519-1531|共13页
作者
Trentin E.; Gori M.;
展开▼
作者单位

Dipt. di Ingegneria dell'Infoimazione, Siena Univ., Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
neural nets; hidden Markov models; speech recognition; Gaussian processes; maximum likelihood estimation; optimisation; gradient methods; learning (artificial intelligence); robust combination; neural network; hidden Markov model; speech recognition; Gaussian emission density; arbitrary parametric assumption; artificial neural network; gradient-ascent technique; emission probabilities; Bourlard and Morgan; divergence problem; maximum-likelihood criterion; maximum a posteriori; global optimization;

机译：神经网络;隐马尔可夫模型;语音识别;高斯过程;最大似然估计;优化;梯度法;学习（人工智能）;鲁棒组合;神经网络;隐马尔可夫模型;语音识别;高斯发射密度;任意参数假设;人工神经网络;梯度上升技术;排放概率;Bourlard和Morgan;散度问题;最大似然准则;最大后验概率;全局最优化;

相似文献

外文文献
中文文献
专利

1. Speech recognition algorithm based on neural network and hidden Markov model [J] . Zhao Jianhui, Gao Hongbo, Liu Yuchao, 中国邮电高校学报（英文版） . 2018,第004期

机译：基于神经网络和隐马尔可夫模型的语音识别算法
2. A HYBRID SPEECH RECOGNITION SYSTEM WITH HIDDEN MARKOV MODEL AND RADIAL BASIS FUNCTION NEURAL NETWORK [J] . Judith Justin, Ila Vennila American journal of applied sciences . 2013,第10期

机译：具有隐马尔可夫模型和径向基函数神经网络的混合语音识别系统。
3. A HYBRID SPEECH RECOGNITION SYSTEM WITH HIDDEN MARKOV MODEL AND RADIAL BASIS FUNCTION NEURAL NETWORK | Science Publications [J] . Ila Vennila, Judith Justin American journal of applied sciences . 2013,第10期

机译：具有隐马尔可夫模型的混合语音识别系统和径向基功能神经网络|科学出版物
4. Robust speech recognition using neural networks and hidden Markov models [C] . Lin Cong, Asghar, S. Information Technology: Coding and Computing, 2000. Proceedings. International Conference on . 2000

机译：使用神经网络和隐马尔可夫模型的鲁棒语音识别
5. Online Learning of Large Margin Hidden Markov Models for Automatic Speech Recognition. [D] . Cheng, Chih-Chieh. 2011

机译：在线学习大余量隐马尔可夫模型以进行自动语音识别。
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. ROBUST SPEECH RECOGNITION USING FUZZY MATRIX QUANTISATION, NEURAL NETWORKS AND HIDDEN MARKOV MODELS [O] . Cong L., Xydeas C.S. 1996

机译：基于模糊矩阵量化，神经网络和隐马尔可夫模型的鲁棒语音识别
8. Robust Speech Recognition Using Hidden Markov Models: Overview of a Research Program. [R] . Weinstein, C. J., Paul, D. B., Lippmann, R. P. 1990

机译：使用隐马尔可夫模型的鲁棒语音识别：研究计划概述。

Robust combination of neural networks and hidden Markov models for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅