首页> 外文会议> >Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification
【24h】

Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification

机译:用于语音识别和说话人识别的MFCC组件的稳健分析和加权

获取原文

摘要

Mismatch between training and testing data is a major error source for both Automatic Speech Recognition (ASR) and Automatic Speaker Identification (ASI). In this paper, we first present a statistical weighting concept to exploit the unequal sensitivity of Mel-Frequency Cepstral Coefficients (MFCC) components to against the mismatch, such as ambient noise, recording equipment, transmission channels, and inter-speaker variations. We further design a new Kullback-Leibler (KL) Distance based weighting algorithm according to the proposed weighting concept to real-world problems in which the label information is often not provided. We examine our algorithm in ASR with mismatch by different speakers and also in ASI with mismatch by channel noises. Experimental results demonstrate the effectiveness and robustness of our proposed method.
机译:培训和测试数据之间的不匹配是自动语音识别(ASR)和自动说话者识别(ASI)的主要错误来源。在本文中,我们首先提出一种统计加权概念,以利用梅尔频率倒谱系数(MFCC)组件对不匹配(例如环境噪声,录音设备,传输通道和扬声器间变化)的不等灵敏度。根据提出的加权概念,我们针对现实世界中通常不提供标签信息的加权概念,进一步设计了一种新的基于Kullback-Leibler(KL)距离的加权算法。我们在不同说话者不匹配的ASR中以及在信道噪声不匹配的ASI中检查我们的算法。实验结果证明了我们提出的方法的有效性和鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号