首页> 外文会议>IEEE Workshop on Multimedia Signal Processing >A Novel Learning Method for Hidden Markov Models in Speech and Audio Processing
【24h】

A Novel Learning Method for Hidden Markov Models in Speech and Audio Processing

机译:一种语音和音频处理中隐马尔可夫模型的新型学习方法

获取原文

摘要

In recent years, various discriminative learning techniques for HMMs have consistently yielded significant benefits in speech recognition. In this paper, we present a novel optimization technique using the Minimum Classification Error (MCE) criterion to optimize the HMM parameters. Unlike Maximum Mutual Information training where an Extended Baum-Welch (EBW) algorithm exists to optimize its objective function, for MCE training the original EBW algorithm cannot be directly applied. In this work, we extend the original EBW algorithm and derive a novel method for MCE-based model parameter estimation. Compared with conventional gradient descent methods for MCE learning, the proposed method gives a solid theoretical basis, stable convergence, and it is well suited for the large-scale batch-mode training process essential in large-scale speech recognition and other pattern recognition applications. Evaluation experiments, including model training and speech recognition, are reported on both a small vocabulary task (TI-Digits) and a large vocabulary task (WSJ), where the effectiveness of the proposed method is demonstrated. We expect new future applications and success of this novel learning method in general pattern recognition and multimedia processing, in addition to speech and audio processing applications we present in this paper.
机译:近年来,各种歧视性学习技术对于汉姆公司始终产生了良好的良好益处。在本文中,我们使用最小分类误差(MCE)标准来提供一种新颖的优化技术来优化HMM参数。与扩展BAUM-WELCH(EBW)算法的最大相互信息培训不同,为了优化其目标函数,对于MCE训练,无法直接应用原始EBW算法。在这项工作中,我们扩展了原始EBW算法并导出了基于MCE的模型参数估计的新方法。与MCE学习的传统梯度下降方法相比,该方法提供了稳定的理论基础,稳定的收敛性,并且非常适用于大规模语音识别和其他模式识别应用中的大规模批量训练过程。在一个小词汇任务(Ti-Dige)和大型词汇表(WSJ)上报道了评估实验,包括模型训练和语音识别,其中证明了所提出的方法的有效性。除了在本文中存在的语音和音频处理应用之外,我们预计在一般模式识别和多媒体处理中的新的未来应用和成功。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号