首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >FILTERING ON THE TEMPORAL PROBABILITY SEQUENCE IN HISTOGRAM EQUALIZATION FOR ROBUST SPEECH RECOGNITION
【24h】

FILTERING ON THE TEMPORAL PROBABILITY SEQUENCE IN HISTOGRAM EQUALIZATION FOR ROBUST SPEECH RECOGNITION

机译:滤除鲁棒语音识别直方图均衡中的时间概率序列

获取原文

摘要

In this paper, we propose a filter-based histogram equalization (FHEQ) approach for robust speech recognition. The FHEQ approach first represents the original acoustic feature sequence with statistic probability. Then, a temporal average (TA) filter is applied to smooth the statistic probability sequence. Finally, the filtered statistic probability sequence is transformed to form a new acoustic feature stream. Filtering on statistic probability of a feature sequence is a novel concept that can incorporate the advantages of the conventional histogram equalization (HEQ) and temporal filtering techniques for better noise robustness. Our experimental results on the Aurora-2 and Aurora-4 tasks show that FHEQ outperforms the conventional cepstral mean subtraction (CMS), cepstral mean and variance normalization (CMVN), and HEQ. Furthermore, we conducted a comparison test on TA-HEQ and HEQ-TA, which apply a TA filter to smooth acoustic features before and after the HEQ processing, respectively. The test results show that FHEQ outperforms both TA-HEQ and HEQ-TA, suggesting that filtering in probability is more effective than filtering in acoustic feature.
机译:在本文中,我们提出了一种基于滤波器的直方图均衡(FHEQ)方法,用于鲁棒语音识别。 FHEQ方法首先表示具有统计概率的原始声学特征序列。然后,应用时间平均值(TA)滤波器以平滑统计概率序列。最后,转换过滤的统计概率序列以形成新的声学特征流。过滤特征序列的统计概率是一种新颖的概念,可以包含传统直方图均衡(HEQ)和时间过滤技术的优点,以获得更好的噪声鲁棒性。我们对极光-2和极光-4任务的实验结果表明,FHEQ优于传统的抗搏斯平均减法(CMS),抗康斯兰均值和方差标准化(CMVN)和HEQ。此外,我们对TA-HEQ和HEQ-TA进行了比较测试,其将TA过滤器应用于HEQ处理之前和之后的光滑声学特征。测试结果表明,FHEQ优于TA-HEQ和HEQ-TA,表明概率滤波比声学特征的滤波更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号