首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Revisiting VTLN Using Linear Transformation on Conventional MFCC
【24h】

Revisiting VTLN Using Linear Transformation on Conventional MFCC

机译:在常规MFCC上使用线性变换重新访问VTLN

获取原文

摘要

In this paper, we revisit the linear transformation for VTLN on conventional MFCC proposed by Sanand et al. in [1], using the idea of band-limited interpolation. The filter-bank is modified to include half-filters at zero and nyquist frequencies, as the full symmetric spectrum is required for performing band-limited interpolation. In this paper, we show that the filter-bank with half-filters does not affect the recognition performance on clean speech (also shown in [1]), but does affect the recognition performance on noisy speech. This motivated us to revisit the linear transformation for VTLN in [1] and propose modifications to undo the affect of half-filters during the feature extraction. We show through recognition experiments that the proposed modifications to the linear transformation have comparable performance as the conventional VTLN approach, still enabling us to perform VTLN using a linear transformation on conventional MFCC.
机译:在本文中,我们将回顾由Sanand等人提出的常规MFCC上VTLN的线性变换。在[1]中,使用了带限插值的思想。修改滤波器组以包括零频率和奈奎斯特频率的半滤波器,因为执行带限内插需要完整的对称频谱。在本文中,我们表明带有半滤波器的滤波器组不会影响干净语音的识别性能(也显示在[1]中),但是会影响嘈杂语音的识别性能。这促使我们重新研究[1]中VTLN的线性变换,并提出修改以消除特征提取期间半滤波器的影响。我们通过识别实验表明,对线性变换的拟议修改具有与常规VTLN方法相当的性能,仍然使我们能够在常规MFCC上使用线性变换来执行VTLN。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号