首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions
【24h】

Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions

机译:不匹配条件下说话人识别的单通道语音增强技术评估

获取原文

摘要

It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. In this paper, we report on a study that considers four different single-channel speech enhancement front-ends for robust SID under such conditions. Speech files from the YOHO database are corrupted with four types of noise including babble, car, factory, and white Gaussian at five SNR levels (0-20 dB), and processed using four speech enhancement techniques representing distinct classes of algorithms: spectral subtraction, statistical model-based, subspace, and Wiener filtering. Both processed and unprocessed files are submitted to a SID system trained on clean data. In addition, a new set of acoustic feature parameters based on Hilbert envelope of gam-matone filterbank outputs are proposed and evaluated for SID task. Experimental results indicate that: (i) depending on the noise type and SNR level, the enhancement front-ends may help or hurt SID performance, (ii) the proposed feature significantly achieves higher SID accuracy compared to MFCCs under mismatched conditions.
机译:众所周知,基于MFCC的说话人识别(SID)系统在不匹配的训练和测试条件下很容易崩溃。在本文中,我们报告了一项研究,该研究考虑了在这种情况下用于鲁棒SID的四个不同的单通道语音增强前端。 YOHO数据库中的语音文件受到五种SNR等级(0-20 dB)的四类噪声的破坏,包括胡言乱语,汽车噪声,工厂噪声和白高斯噪声,并使用代表四种不同算法类别的四种语音增强技术对其进行处理:频谱减法,基于统计模型的子空间和Wiener过滤。已处理的文件和未处理的文件都将提交到接受过干净数据培训的SID系统。此外,提出了基于gam-matone滤波器组输出的希尔伯特包络的一组新的声学特征参数,并对其进行了SID任务评估。实验结果表明:(i)取决于噪声类型和SNR级别,增强前端可能会帮助或损害SID性能;(ii)与不匹配条件下的MFCC相比,所提出的功能显着提高了SID的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号