首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >Automatic speech recognition based on weighted minimum classification error (W-MCE) training method
【24h】

Automatic speech recognition based on weighted minimum classification error (W-MCE) training method

机译:基于加权最小分类误差(W-MCE)训练方法的自动语音识别

获取原文

摘要

The Bayes decision theory [1] is the foundation of the classical statistical pattern recognition approach. For most of pattern recognition problems, the Bayes decision theory is employed assuming that the system performance metric is defined as the simple error counting, which assigns identical cost to each recognition error. However, this prevalent performance metric is not desirable in many practical applications. For example, the cost of “recognition” error is required to be differentiated in keyword spotting systems. In this paper, we propose an extended framework for the speech recognition problem with non-uniform classification/recognition error cost. As the system performance metric, the recognition error is weighted based on the task objective. The Bayes decision theory is employed according to this performance metric and the decision rule with a non-uniform error cost function is derived. We argue that the minimum classification error (MCE) method, after appropriate generalization, is the most suitable training algorithm for the “optimal” classifier design to minimize the weighted error rate. We formulate the weighted MCE (W-MCE) algorithm based on the conventional MCE infrastructure by integrating the error cost and the recognition error count into one objective function. In the context of automatic speech recognition (ASR), we present a variety of training scenarios and weighting strategies under this extended framework. The experimental demonstration for large vocabulary continuous speech recognition is provided to support the effectiveness of our approach.
机译:贝叶斯决策理论[1]是古典统计模式识别方法的基础。对于大多数模式识别问题,采用贝叶斯决策理论假设系统性能度量被定义为简单的错误计数,这为每个识别错误分配相同的成本。然而,在许多实际应用中,这种普遍的性能度量是不可取的。例如,“识别”误差的成本是在关键字发现系统中区分。在本文中,我们提出了一种具有非统一分类/识别误差成本的语音识别问题的扩展框架。作为系统性能度量,基于任务目标加权识别错误。贝叶斯决策理论根据这种性能指标采用,导出了具有非均匀错误成本函数的决策规则。我们认为,在适当的泛化之后,最小分类误差(MCE)方法是最合适的“最优”分类器设计的训练算法,以最小化加权误差率。我们通过将误差成本与识别误差计数集成到一个目标函数来基于传统MCE基础设施的基于传统MCE基础设施的加权MCE(W-MCE)算法。在自动语音识别(ASR)的背景下,我们在这一扩展框架下提出了各种培训场景和加权策略。提供大型词汇连续语音识别的实验演示,以支持我们的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号