首页> 外文会议>10th IEEE International Symposium on Signal Processing and Information Technology >Effect of MFCC normalization on vector quantization based speaker identification
【24h】

Effect of MFCC normalization on vector quantization based speaker identification

机译:MFCC归一化对基于矢量量化的说话人识别的影响

获取原文

摘要

Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature vector. Another pre-processing is coefficients mean normalization. In this paper, the effect of these two processes on the accuracy of a Vector Quantization (VQ) speaker identification system is compared. Additionally, it is shown that coefficient variance normalization, which is less common, can improve the accuracy.
机译:梅尔频率倒谱系数(MFCC)被广泛用于语音识别和说话人识别。 MFCC功能通常在用于识别之前先进行预处理。这些预处理之一是创建delta和delta-delta系数,并将它们附加到MFCC以创建特征向量。另一个预处理是系数均值归一化。在本文中,比较了这两个过程对矢量量化(VQ)说话人识别系统准确性的影响。另外,还表明,较少见的系数方差归一化可以提高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号