首页> 外文期刊>Signal processing >Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition
【24h】

Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition

机译:独立矢量分析,然后基于HMM的功能增强,可实现可靠的语音识别

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents a feature-enhancement method that uses the outputs of independent vector analysis (IVA) for robust speech recognition. Although frequency-domain(FD) independent component analysis (ICA) can be successfully used in preprocessing of speech recognition because of its capability of blind source separation (BSS), the performance of the conventional ICA-based approaches is significantly degraded in under-determined cases. Assuming the target speaker is located relatively close to microphones, the blind spatial subtraction array (BSSA) (Takahashi et al.) tries to enhance target speech features by subtracting noise spectra estimated by FD ICA, even in the under-determined cases. Unfortunately, the ICA may not be proficient at target speech estimation and then may cause inaccurate noise spectrum estimation. To improve robustness of speech recognition with the inaccurate noise spectra, we introduce Bayesian inference to estimate clean speech features. For a further improvement, FD ICA and delay-and-sum beamforming in the BSSA are replaced with IVA and its target speech output because IVA can improve separation performance without the permutation problem. Experimental results show that the proposed method can further reduce the relative word error rates by 60.11% and 20.07% on average compared to the BSSA for the AURORA2 and DARPA Resource Management databases, respectively.
机译:本文提出了一种功能增强方法,该方法使用独立矢量分析(IVA)的输出进行鲁棒的语音识别。尽管由于其盲源分离(BSS)的能力,频域(FD)独立分量分析(ICA)可以成功地用于语音识别的预处理,但是基于ICA的传统方法的性能在未充分确定的情况下会大大降低案件。假设目标说话者位于相对靠近麦克风的位置,即使在不确定的情况下,盲空间减法阵列(BSSA)(Takahashi等人)也试图通过减去FD ICA估计的噪声频谱来增强目标语音特征。不幸的是,ICA可能不擅长目标语音估计,然后可能导致噪声频谱估计不准确。为了使用不准确的噪声频谱提高语音识别的鲁棒性,我们引入贝叶斯推理来估计干净的语音特征。为了进一步改进,将BSSA中的FD ICA和延迟与求和波束成形替换为IVA及其目标语音输出,因为IVA可以改善分离性能而不会出现置换问题。实验结果表明,与AURORA2数据库和DARPA资源管理数据库的BSSA相比,该方法可以将相对字错误率平均分别降低60.11%和20.07%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号