...
首页> 外文期刊>Turkish Journal of Electrical Engineering and Computer Sciences >Scale-invariant MFCCs for speech/speaker recognition
【24h】

Scale-invariant MFCCs for speech/speaker recognition

机译:用于语音/扬声器识别的Scale-Invariant MFCC

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The feature extraction process is a fundamental part of speech processing. Mel frequency cepstral coefficients (MFCCs) are the most commonly used feature types in the speech/speaker recognition literature. However, the MFCC framework may face numerical issues or dynamic range problems, which decreases their performance. A practical solution to these problems is adding a constant to filter-bank magnitudes before log compression, thus violating the scale-invariant property. In this work, a magnitude normalization and a multiplication constant are introduced to make the MFCCs scale-invariant and to avoid dynamic range expansion of nonspeech frames. Speaker verification experiments are conducted to show the effectiveness of the proposed scheme.
机译:特征提取过程是语音处理的基本部分。 MEL频率患者系数(MFCC)是语音/扬声器识别文献中最常用的特征类型。但是,MFCC框架可能面临数值问题或动态范围问题,这降低了它们的性能。对这些问题的实际解决方案在日志压缩之前将常数添加到滤波器库幅度,从而违反了规模不变的属性。在这项工作中,引入了幅度归一化和乘法常数,以使MFCCS鳞片不变,并避免NonsPeech帧的动态范围扩展。扬声器验证实验进行了展示拟议计划的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号