Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

Garcia-Moral A. I.; Solera-Urena R.; Pelaez-Moreno C.; Diaz-de-Maria F.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

【24h】

Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

机译：数据平衡可有效训练混合ANN / HMM自动语音识别系统

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hybrid speech recognizers, where the estimation of the emission pdf of the states of hidden Markov models (HMMs), usually carried out using Gaussian mixture models (GMMs), is substituted by artificial neural networks (ANNs) have several advantages over the classical systems. However, to obtain performance improvements, the computational requirements are heavily increased because of the need to train the ANN. Departing from the observation of the remarkable skewness of speech data, this paper proposes sifting out the training set and balancing the amount of samples per class. With this method, the training time has been reduced 18 times while obtaining performances similar to or even better than those with the whole database, especially in noisy environments. However, the application of these reduced sets is not straightforward. To avoid the mismatch between training and testing conditions created by the modification of the distribution of the training data, a proper scaling of the a posteriori probabilities obtained and a resizing of the context window need to be performed as demonstrated in this paper.

机译：混合语音识别器通常使用高斯混合模型（GMM）进行隐马尔可夫模型（HMM）的状态的pdf估计，与传统系统相比，人工神经网络（ANN）替代了混合语音识别器。但是，为了获得性能改进，由于需要训练ANN，因此大大增加了计算需求。与对语音数据明显偏斜的观察不同，本文建议筛选出训练集并平衡每个班级的样本量。使用这种方法，训练时间减少了18倍，同时获得了与整个数据库相似甚至更好的性能，尤其是在嘈杂的环境中。但是，这些简化集的应用并不简单。为了避免由于修改训练数据的分布而导致的训练条件与测试条件之间的不匹配，如本文所示，需要对获得的后验概率进行适当的缩放，并调整上下文窗口的大小。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2011年第3期|p.468-481|共14页
作者
Garcia-Moral A. I.; Solera-Urena R.; Pelaez-Moreno C.; Diaz-de-Maria F.;
展开▼
作者单位

Signal Processing and Communications Department, University Carlos III Madrid, Leganés, Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
ANN/HMM; Active learning; MLP/HMM; additive noise; artificial neural networks (ANNs); hidden Markov models (HMMs); hybrid automatic speech recognition (ASR); machine learning; multilayer perceptrons (MLPs); robust ASR;

机译：ANN / HMM;主动学习;MLP / HMM;加性噪声;人工神经网络（ANN）;隐马尔可夫模型（HMM）;混合自动语音识别（ASR）;机器学习;多层感知器（MLP）;稳健的ASR;

相似文献

外文文献
中文文献
专利

1. Discriminative training of HMMs for automatic speech recognition: A survey [J] . Hui Jiang Computer speech and language . 2010,第4期

机译：用于自动语音识别的HMM的歧视性培训：一项调查
2. A survey of hybrid ANN/HMM models for automatic speech recognition [J] . Edmondo Trentin, Marco Gori Neurocomputing . 2001,第期

机译：用于自动语音识别的混合ANN / HMM模型的调查
3. Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system [J] . Pujol P., Pol S., Nadeu C., IEEE Transactions on Speech and Audio Proceessing . 2005,第1期

机译：HMM / MLP和HMM / GMM混合语音识别系统中功能的比较和组合
4. A new training algorithm for hybrid HMM/ANN speech recognition systems [C] . Bourlard Herve, Konig Yochai, Morgan Nelson, European Signal Processing Conference . 1996

机译：HMM / ANN混合语音识别系统的新训练算法
5. Development and optimization of a HMM-ANN hybrid structure for robust recognition of impaired speech signals [D] . Polur, Prasad Babu Deenadayalan 2005

机译：HMM-ANN混合结构的开发和优化，可用于语音信号的鲁棒识别
6. Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech [O] . Santiago-Omar Caballero-Morales 2013

机译：语音异常自动识别的音素特定HMM拓扑估计
7. Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems [O] . García-Moral Ana I., Solera Ureña R., Peláez-Moreno Carmen, 2011

机译：混合aNN / Hmm自动语音识别系统高效训练的数据平衡

Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

摘要

著录项

相似文献

相关主题

期刊订阅