首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems
【24h】

Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

机译:数据平衡可有效训练混合ANN / HMM自动语音识别系统

获取原文
获取原文并翻译 | 示例
           

摘要

Hybrid speech recognizers, where the estimation of the emission pdf of the states of hidden Markov models (HMMs), usually carried out using Gaussian mixture models (GMMs), is substituted by artificial neural networks (ANNs) have several advantages over the classical systems. However, to obtain performance improvements, the computational requirements are heavily increased because of the need to train the ANN. Departing from the observation of the remarkable skewness of speech data, this paper proposes sifting out the training set and balancing the amount of samples per class. With this method, the training time has been reduced 18 times while obtaining performances similar to or even better than those with the whole database, especially in noisy environments. However, the application of these reduced sets is not straightforward. To avoid the mismatch between training and testing conditions created by the modification of the distribution of the training data, a proper scaling of the a posteriori probabilities obtained and a resizing of the context window need to be performed as demonstrated in this paper.
机译:混合语音识别器通常使用高斯混合模型(GMM)进行隐马尔可夫模型(HMM)的状态的pdf估计,与传统系统相比,人工神经网络(ANN)替代了混合语音识别器。但是,为了获得性能改进,由于需要训练ANN,因此大大增加了计算需求。与对语音数据明显偏斜的观察不同,本文建议筛选出训练集并平衡每个班级的样本量。使用这种方法,训练时间减少了18倍,同时获得了与整个数据库相似甚至更好的性能,尤其是在嘈杂的环境中。但是,这些简化集的应用并不简单。为了避免由于修改训练数据的分布而导致的训练条件与测试条件之间的不匹配,如本文所示,需要对获得的后验概率进行适当的缩放,并调整上下文窗口的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号