首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch
【24h】

Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch

机译:语音不匹配下从正常到呼出的语音频谱映射,用于说话人识别

获取原文
获取外文期刊封面目录资料

摘要

Speaker recognition performance degrades substantially in case of vocal effort mismatch (e.g. shouted vs. normal speech) between test and enrollment utterances. Such a mismatch is often encountered, for example, in forensic speaker recognition. This paper introduces a novel spectral mapping method which, when employed jointly with a statistical mapping technique, converts the Mel-frequency band energies of normal speech towards their counterparts in shouted speech. The aim is to obtain more robust performance in speaker recognition by tackling vocal effort mismatch between enrollment and test utterances. The processing is performed on the speech signal before feature extraction. The proposed approach was evaluated by testing the performance of a state-of-the-art i-vector-based speaker recognition system with and without applying the spectral mapping processing to the enrollment data. The results show that pre-processing with the proposed approach results in considerable improvement in correct identification rates.
机译:在测试和录取语音之间的语音配音不匹配(例如喊声与正常语音)的情况下,说话者的识别性能会大大降低。例如,在法医说话者识别中经常会遇到这种不匹配。本文介绍了一种新颖的频谱映射方法,该方法与统计映射技术结合使用时,可以将正常语音的梅尔频带能量转换为大声语音中的对应频带。目的是通过解决注册和测试话语之间的语音不匹配,从而在说话者识别中获得更强大的性能。在特征提取之前对语音信号执行该处理。通过测试使用和不使用光谱映射处理到注册数据的最新的基于i-vector的说话人识别系统的性能,对提出的方法进行了评估。结果表明,采用提出的方法进行预处理可显着提高正确识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号