首页> 外文OA文献 >Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems
【2h】

Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

机译:混合aNN / Hmm自动语音识别系统高效训练的数据平衡

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Hybrid speech recognizers, where the estimation of the emission pdf of the states of Hidden Markov Models (HMMs), usually carried out using Gaussian Mixture Models (GMMs), is substituted by Artificial Neural Networks (ANNs) have several advantages over the classical systems. However, to obtain performance improvements, the computational requirements are heavily increased because of the need to train the ANN. Departing from the observation of the remarkable skewness of speech data, this paper proposes sifting out the training set and balancing the amount of samples per class. With this method the training time has been reduced 18 times while obtaining performances similar to or even better than those with the whole database, especially in noisy environments. However, the application of these reduced sets is not straightforward. To avoid the mismatch between training and testing conditions created by the modification of the distribution of the training data, a proper scaling of the a posteriori probabilities obtained and a resizing of the context window need to be performed as demonstrated in the paper.
机译:混合语音识别器(通常使用高斯混合模型(GMM)进行的隐马尔可夫模型(HMM)状态的pdf发射估计)被人工神经网络(ANN)替代,比传统系统具有多个优势。但是,为了获得性能改进,由于需要训练ANN,因此大大增加了计算需求。与对语音数据明显偏斜的观察不同,本文建议筛选出训练集并平衡每个班级的样本量。使用这种方法,训练时间减少了18倍,同时获得了与整个数据库相似甚至更好的性能,尤其是在嘈杂的环境中。但是,这些简化集的应用并不简单。为了避免由于修改训练数据的分布而导致的训练条件与测试条件之间的不匹配,如本文所述,需要对获得的后验概率进行适当的缩放,并调整上下文窗口的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号