首页> 外文会议>International conference on advances in computing, communications and informatics >The effect of DC coefficient on mMFCC and mIMFCC for robust speaker recognition
【24h】

The effect of DC coefficient on mMFCC and mIMFCC for robust speaker recognition

机译:直流系数对mMFCC和mIMFCC的影响,以增强说话人识别能力

获取原文

摘要

In Speaker Recognition (SR) system, feature extraction is one of the crucial steps where the particular speaker related information is extracted. The state of the art algorithm for this purpose is Mel Frequency Cepstral Coefficient (MFCC), and its complementary feature, Inverted Mel Frequency Cepstral Coefficient (IMFCC). MFCC is based on mel scale and IMFCC is based on inverted mel (imel) scale. There are two another set of features we proposed as mMFCC and mIMFCC. In state-of-the-art system, we neglect the DC co-efficient of DCT from the feature set. In this paper, the DC coefficient and its effect on recognition accuracy on MFCC-IMFCC, as well as, mMFCC-mIMFCC has been studied. This has been verified on two standard different types of databases, like, YOHO for clean speech signal and POLYCOST for telephone based speech. The recognition accuracy of the proposed feature is better than their respective baseline feature when the DC coefficient was included, as well as, when it was not included.
机译:在说话者识别(SR)系统中,特征提取是提取特定说话者相关信息的关键步骤之一。为此目的,最先进的算法是梅尔频率倒谱系数(MFCC),以及它的互补功能,倒梅尔频率倒谱系数(IMFCC)。 MFCC基于梅尔标度,而IMFCC基于倒梅尔(imel)标度。我们提议了另外两组功能,即mMFCC和mIMFCC。在最新的系统中,我们从功能集中忽略了DCT的DC系数。本文研究了直流系数及其对MFCC-IMFCC以及mMFCC-mIMFCC的识别精度的影响。已经在两种标准的不同类型的数据库上对此进行了验证,例如,用于纯净语音信号的YOHO和用于基于电话的语音的POLYCOST。当包括DC系数以及不包括DC系数时,所提出特征的识别精度均优于其各自的基线特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号