首页> 外文会议> >Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition

【24h】

Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition

机译：将Jacobian自适应与倒谱均值归一化和并行模型组合进行比较，以实现噪声鲁棒的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, two techniques are researched for Jacobian adaptation (JA) in the presence of additive noise. Since the original concept of JA was presented only for static cepstral coefficients, the performance of JA is researched when it is extended to cover also the delta cepstrum. However, this extension or the original concept can not provide accurate recognition performance when the mismatch between the training and recognition environments is out of the linear range of JA. Hence, this problem can be alleviated to some extent by dividing JA into two steps. At first, the adaptation is done e.g. from clean to the target environment having "high" SNR level. After that, the new JA matrices are calculated and they are used in the second step to adapt the system to the lower target SNR level. Both of the above adaptation methods have been compared to cepstral mean normalization (CMN) and parallel model combination (PMC) in an isolated word recognition task having a vocabulary of 200 English words. The best performance was achieved with PMC but JA showed comparable performance to CMN and outperformed it when JA was done in two steps from SNR of 25 dB to 5 dB. The system was tested with the SpeechDat(II) database by adding noise recorded inside a car to the test set utterances at various SNR levels.

机译：在本文中，研究了在存在加性噪声的情况下针对雅可比适应（JA）的两种技术。由于JA的原始概念仅针对静态倒频谱系数提出，因此在扩展JA到同时涵盖三角倒频谱时，将对JA的性能进行研究。但是，当训练和识别环境之间的不匹配超出JA的线性范围时，此扩展或原始概念无法提供准确的识别性能。因此，通过将JA分为两个步骤，可以在某种程度上缓解此问题。首先，进行适应例如从干净到具有“高” SNR水平的目标环境。之后，将计算新的JA矩阵，并将其用于第二步，以使系统适应较低的目标SNR级别。在具有200个英语单词的词汇量的孤立单词识别任务中，已将上述两种自适应方法与倒谱平均归一化（CMN）和并行模型组合（PMC）进行了比较。 PMC可以达到最佳性能，但是JA表现出与CMN相当的性能，并且在从25 dB的SNR到5 dB的两个步骤中完成JA时，JA的性能均胜过其。通过在不同的SNR级别上将记录在车内的噪音添加到测试装置的发声中，使用SpeechDat（II）数据库对该系统进行了测试。

著录项

来源
《》|2002年|p.193-196|共4页
会议地点
作者
Parssinen; K.; Salmela; P.; Harju; M.; Kiss; I.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
acoustic noise; Jacobian matrices; speech recognition; cepstral analysis; Jacobian adaptation; cepstral mean normalization; parallel model combination; additive noise; speech recognition; noise robustness; delta cepstrum; SNR level; isolated word rec;

机译：声音噪声雅可比矩阵;语音识别;倒谱分析雅可比适应性;倒谱均值归一化;并行模型组合;加性噪声;语音识别;噪声鲁棒性;三角倒谱信噪比等级;孤立的单词rec;

相似文献

外文文献
中文文献
专利

1. Noise-robust speech recognition by discriminative adaptation in parallel model combination [J] . Yong-Joo Chung Electronics Letters . 2000,第4期

机译：并行模型组合中判别自适应的鲁棒语音识别
2. The integration of principal component analysis and cepstral mean subtraction in parallel model combination for robust speech recognition [J] . Veisi H., Sameti H. Digital Signal Processing . 2011,第1期

机译：并行模型组合中主成分分析和倒谱均值减法的集成，可实现鲁棒的语音识别
3. Incorporating Codebook and Utterance Information in Cepstral Statistics Normalization Techniques for Robust Speech Recognition in Additive Noise Environments [J] . Jeih-weih HungWen-hsiang Tu Signal Processing Letters, IEEE . 2009,第6期

机译：在倒数统计归一化技术中整合密码本和话语信息，以在加性噪声环境中实现可靠的语音识别
4. Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition [C] . Airssinen, K., Salmela, . 2002

机译：将Jacobian自适应与倒谱均值归一化和并行模型组合进行比较，以实现噪声鲁棒的语音识别
5. The modified-mean cepstral mean normalization (MMCMN) method for channel-robust automatic speaker recognition. [D] . Garcia, Alvin A. 2002

机译：改进的均值倒谱均值归一化（MMCMN）方法用于声道鲁棒性自动说话人识别。
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. Robust Speech Recognition by Model Adaptation and Normalization Using Pre-Observed Noise [O] . S. KOBASHIKAWA, S. TAKAHASHI 2008

机译：通过预观察噪声模型适应和标准化的强大语音识别

Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅