首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Sequential estimation with optimal forgetting for robust speech recognition
【24h】

Sequential estimation with optimal forgetting for robust speech recognition

机译:具有最佳遗忘的顺序估计,可实现可靠的语音识别

获取原文
获取原文并翻译 | 示例
       

摘要

Mismatch is known to degrade the performance of speech recognition systems. In real life applications we often encounter nonstationary mismatch sources. A general way to compensate for slowly time varying mismatch is by using sequential algorithms with forgetting. The choice of the forgetting factor is usually performed empirically on some development data, and no optimality criterion is used. In this paper we introduce a framework for obtaining optimal forgetting factor. In sequential algorithms, a recursion is usually used to calculate the required parameters so as to optimize a certain performance measure. To obtain optimal forgetting, we develop a recursion to calculate the forgetting factor that optimizes the same performance criterion as done in the original recursion. When combined together the two recursions result in a sequential algorithm that simultaneously optimizes the desired parameters and the forgetting factor. The proposed method is applied in conjunction with a sequential noise estimation algorithm, but the same principle can be extended to a wide range of sequential algorithms. The algorithm is extensively evaluated for different speech recognition tasks: the 5K Wall Street Journal task corrupted by different types of artificially added noise, a command and digit database recorded in a noisy car environment, and a 20K Japanese broadcast news task corrupted by field noise. In all situations it was found that the sequential algorithm performs as well as or better than batch estimation. In addition, the proposed optimal forgetting algorithm performs as well as the best hand tuned forgetting factor. This results in a continuously adaptive compensation technique without the need of any manual adjustment.
机译:已知不匹配会降低语音识别系统的性能。在现实生活中,我们经常会遇到非平稳的失配源。补偿时变缓慢的不匹配的一般方法是使用带有遗忘的顺序算法。遗忘因子的选择通常是根据经验对一些开发数据进行的,并且不使用最优性标准。在本文中,我们介绍了获取最佳遗忘因子的框架。在顺序算法中,通常使用递归来计算所需参数,以优化某个性能指标。为了获得最佳的遗忘,我们开发了递归来计算遗忘因子,该遗忘因子可以优化与原始递归相同的性能标准。当两个递归组合在一起时,将产生一个顺序算法,可同时优化所需参数和遗忘因子。所提出的方法与顺序噪声估计算法结合使用,但是相同的原理可以扩展到广泛的顺序算法中。该算法针对不同的语音识别任务进行了广泛评估:5K《华尔街日报》任务因不同类型的人为添加的噪声而损坏,在嘈杂的汽车环境中记录的命令和数字数据库以及20K的日本广播新闻任务因现场噪声而损坏。在所有情况下,都发现顺序算法的性能优于或优于批估计。另外,所提出的最佳遗忘算法的性能与最佳的手动遗忘因子相同。这导致了一种连续自适应的补偿技术,而无需任何手动调整。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号