首页> 外文OA文献 >Robust speech recognition using model-based feature enhancement and harmonic decomposition in the missing data framework
【2h】

Robust speech recognition using model-based feature enhancement and harmonic decomposition in the missing data framework

机译:在缺失的数据框架中使用基于模型的特征增强和谐波分解来进行稳健的语音识别

摘要

Missing Data Theory (MDT) has shown to improve the robustnessof automatic speech recognition (ASR) systems. A crucialpart in a MDT-based recogniser is the computation of thereliability masks from noisy data. For each component of thefeature vector extracted from the noisy observation, the MissingData Detector (MDD) makes a decision about the presence ofspeech and noise. The components that are likely to be dominatedby the noise, are labelled as unreliable and their values willbe estimated from the reliable observations. In this paper, we exploitthe Model-Based Feature Enhancement (MBFE) techniquein the reliability decisions of the MDD. This technique makesuse of statistical models of clean speech and noise to generateestimates of the original speech and noise feature vectors. Decisioncriteria obtained by combining the MBFE-estimates withthe components derived from the harmonic decomposition of thenoisy speech signal showed a further increase in performance ofthe recognition system operating on noisy signals.
机译:丢失数据理论(MDT)已显示出提高了自动语音识别(ASR)系统的鲁棒性。基于MDT的识别器的关键部分是根据噪声数据计算可靠性掩码。对于从噪声观测中提取的特征向量的每个分量,MissingData Detector(MDD)都会做出关于语音和噪声存在的决策。可能由噪声支配的组件被标记为不可靠,并且将从可靠的观察值中估计其值。在本文中,我们在MDD的可靠性决策中采用了基于模型的特征增强(MBFE)技术。该技术利用干净语音和噪声的统计模型来生成原始语音和噪声特征向量的估计。通过将MBFE估计值与从有声语音信号的谐波分解中得出的分量相结合而获得的决策标准表明,在有声信号上运行的识别系统的性能进一步提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号