Sequential estimation with optimal forgetting for robust speech recognition

Afify M.; Siohan O.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Sequential estimation with optimal forgetting for robust speech recognition

【24h】

Sequential estimation with optimal forgetting for robust speech recognition

机译：具有最佳遗忘的顺序估计，可实现可靠的语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mismatch is known to degrade the performance of speech recognition systems. In real life applications we often encounter nonstationary mismatch sources. A general way to compensate for slowly time varying mismatch is by using sequential algorithms with forgetting. The choice of the forgetting factor is usually performed empirically on some development data, and no optimality criterion is used. In this paper we introduce a framework for obtaining optimal forgetting factor. In sequential algorithms, a recursion is usually used to calculate the required parameters so as to optimize a certain performance measure. To obtain optimal forgetting, we develop a recursion to calculate the forgetting factor that optimizes the same performance criterion as done in the original recursion. When combined together the two recursions result in a sequential algorithm that simultaneously optimizes the desired parameters and the forgetting factor. The proposed method is applied in conjunction with a sequential noise estimation algorithm, but the same principle can be extended to a wide range of sequential algorithms. The algorithm is extensively evaluated for different speech recognition tasks: the 5K Wall Street Journal task corrupted by different types of artificially added noise, a command and digit database recorded in a noisy car environment, and a 20K Japanese broadcast news task corrupted by field noise. In all situations it was found that the sequential algorithm performs as well as or better than batch estimation. In addition, the proposed optimal forgetting algorithm performs as well as the best hand tuned forgetting factor. This results in a continuously adaptive compensation technique without the need of any manual adjustment.

机译：已知不匹配会降低语音识别系统的性能。在现实生活中，我们经常会遇到非平稳的失配源。补偿时变缓慢的不匹配的一般方法是使用带有遗忘的顺序算法。遗忘因子的选择通常是根据经验对一些开发数据进行的，并且不使用最优性标准。在本文中，我们介绍了获取最佳遗忘因子的框架。在顺序算法中，通常使用递归来计算所需参数，以优化某个性能指标。为了获得最佳的遗忘，我们开发了递归来计算遗忘因子，该遗忘因子可以优化与原始递归相同的性能标准。当两个递归组合在一起时，将产生一个顺序算法，可同时优化所需参数和遗忘因子。所提出的方法与顺序噪声估计算法结合使用，但是相同的原理可以扩展到广泛的顺序算法中。该算法针对不同的语音识别任务进行了广泛评估：5K《华尔街日报》任务因不同类型的人为添加的噪声而损坏，在嘈杂的汽车环境中记录的命令和数字数据库以及20K的日本广播新闻任务因现场噪声而损坏。在所有情况下，都发现顺序算法的性能优于或优于批估计。另外，所提出的最佳遗忘算法的性能与最佳的手动遗忘因子相同。这导致了一种连续自适应的补偿技术，而无需任何手动调整。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |2004年第1期|p.19-26|共8页
作者
Afify M.; Siohan O.;
展开▼
作者单位

Fac. of Inf. & Comput., Cairo Univ., Giza, Egypt;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
speech recognition; sequential estimation; optimisation; interference (signal); parameter estimation; speech recognition systems; nonstationary mismatch sources; slowly time varying mismatch; sequential algorithms; optimal forgetting factor; optimality criterion; 5K Wall Street Journal task; sequential noise estimation algorithm; command and digit database; noisy car environment; 20 K Japanese broadcast news task; stochastic approximation; noise compensation;

机译：语音识别;顺序估计;优化;干扰（信号）;参数估计;语音识别系统;非平稳失配源;慢时变失配;顺序算法;最佳遗忘因子;最优性准则;5K华尔街日报任务;顺序噪声估计算法;命令和数字数据库;嘈杂的汽车环境;20K日文广播新闻任务;随机逼近;噪声补偿;
入库时间 2022-08-18 00:13:04

相似文献

外文文献
中文文献
专利

1. Sequential estimation with optimal forgetting for robust speech recognition [J] . Afify M., Siohan O. IEEE Transactions on Speech and Audio Proceessing . 2004,第1期

机译：具有最佳遗忘的顺序估计，可实现可靠的语音识别
2. Combination of GMM-Based Speech Estimation Method and Temporal Domain SVD-Based Speech Enhancement for Noise Robust Speech Recognition [J] . Masakiyo Fujimoto, Yasuo Ariki Systems and Computers in Japan . 2007,第3期

机译：基于GMM的语音估计方法与基于时域SVD的语音增强相结合的噪声鲁棒语音识别
3. Noise robust speech recognition using GMM based speech estimation method [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 音声. Speech . 2002,第529期

机译：基于基于GMM的语音估计方法的噪声鲁棒语音识别
4. Sequential noise estimation with optimal forgetting for robust speech recognition [C] . Afify, M., Siohan, . 2001

机译：具有最佳遗忘功能的顺序噪声估计，可实现可靠的语音识别
5. ASR-driven binary mask estimation for robust automatic speech recognition [D] . Hartmann, William 2012

机译：ASR驱动的二进制掩码估计可实现强大的自动语音识别
6. Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions [O] . Baqun Zhang, Anastasios A. Tsiatis, Eric B. Laber, -1

机译：针对顺序治疗决策的最佳动态治疗方案的可靠估计
7. Noise Robust Automatic Speech Recognition with Adaptive Quantile Based Noise Estimation and Speech Band Emphasizing Filter Bank [O] . Casper Stork Bonde, Carina Graversen, Andreas Gregers Gregersen, 2008

机译：基于自适应分位数的噪声估计和语音带增强滤波器组的鲁棒自动语音识别

Sequential estimation with optimal forgetting for robust speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅