首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition
【24h】

Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition

机译:调制域中复数值声谱的空间直方图均衡化,用于噪声鲁棒的语音识别

获取原文

摘要

This paper proposes to enhance the complex-valued acoustic spectrograms of speech signals via the technique of histogram equalization (HEQ) to produce noise-robust features for recognition. The presented method extends our previous work in the task of spectrogram enhancement and has two significant aspects. First, we process the real and imaginary parts of acoustic spectrograms separately, and therefore both of the corresponding magnitude and phase components can be enhanced implicitly. Second, we apply FIR filters to the intra-frame acoustic spectra to acquire the respective local structural statistics, which are subsequently employed to perform various types of HEQ on the acoustic spectrograms for robustifying the resulting speech features. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods was thoroughly tested and verified by comparisons with other well-known robustness methods, which reveals the capability of our methods in promoting the noise robustness of speech features.
机译:本文提出通过直方图均衡(HEQ)技术来增强语音信号的复数值声谱图,以产生用于识别的鲁棒性特征。提出的方法扩展了我们先前在频谱图增强任务中的工作,并具有两个重要方面。首先,我们分别处理声谱图的实部和虚部,因此可以隐式增强相应的幅度和相位分量。其次,我们将FIR滤波器应用于帧内声谱,以获取各自的局部结构统计信息,随后将其用于对声谱图执行各种类型的HEQ,以增强所得到的语音特征。所有实验均在Aurora-2数据库和任务上进行。通过与其他众所周知的鲁棒性方法进行比较,对所提出方法的性能进行了彻底的测试和验证,这揭示了我们的方法在增强语音特征的噪声鲁棒性方面的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号