首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering
【24h】

What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering

机译:汉明窗之外还有什么新东西?坚固的MFCC,可通过多锥度识别说话人

获取原文

摘要

Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Mul-titaper methods form a spectrum estimate using multiple window functions and frequency-domain averaging. Multitapers provide a robust spectrum estimate but have not received much attention in speech processing. Our speaker recognition experiment on NIST 2002 yields equal error rates (EERs) of 9.66 % (clean data) and 16.41 % (-10 dB SNR) for the conventional Hamming method and 8.13 % (clean data) and 14.63 % (-10 dB SNR) using multitapers. Multitapering is a simple and robust alternative to the Hamming window method.
机译:通常,梅尔频率倒谱系数(MFCC)是通过汉明窗DFT频谱导出的。在本文中,我们提倡使用所谓的多锥度方法。 Mul-titaper方法使用多个窗口函数和频域平均来形成频谱估计。多锥提供了可靠的频谱估计,但在语音处理中并未引起太多关注。我们在NIST 2002上的说话人识别实验产生的传统汉明方法的平均误码率(EER)为9.66%(原始数据)和16.41%(-10 dB SNR),以及8.13%(原始数据)和14.63%(-10 dB SNR) )使用多锥。多锥度是汉明窗法的一种简单而强大的替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号