首页> 外文会议>Annual conference of the International Speech Communication Association >Revisiting VTLN Using Linear Transformation on Conventional MFCC
【24h】

Revisiting VTLN Using Linear Transformation on Conventional MFCC

机译:在传统MFCC上使用线性变换重新探测VTLN

获取原文

摘要

In this paper, we revisit the linear transformation for VTLN on conventional MFCC proposed by Sanand et al. in [1], using the idea of band-limited interpolation. The filter-bank is modified to include half-filters at zero and nyquist frequencies, as the full symmetric spectrum is required for performing band-limited interpolation. In this paper, we show that the filter-bank with half-filters does not affect the recognition performance on clean speech (also shown in [1]), but does affect the recognition performance on noisy speech. This motivated us to revisit the linear transformation for VTLN in [1] and propose modifications to undo the affect of half-filters during the feature extraction. We show through recognition experiments that the proposed modifications to the linear transformation have comparable performance as the conventional VTLN approach, still enabling us to perform VTLN using a linear transformation on conventional MFCC.
机译:在本文中,我们重新审视了Sanand等人提出的传统MFCC上的VTLN线性变换。在[1]中,使用带限量插值的想法。滤波器存储体被修改为包括零和奈奎斯特频率的半滤波器,因为需要对频带限制插值所需的完整对称频谱。在本文中,我们表明,具有半滤波器的滤波器组不影响清洁语音上的识别性能(也在[1]中显示),但确实影响了嘈杂语音的识别性能。这使我们能够重新审视[1]中VTLN的线性变换,并提出修改,以在特征提取过程中撤消半滤波器的影响。我们通过识别实验表明,提出对线性变换的修改具有与传统的VTLN方法相当的性能,仍然可以使用传统MFCC上使用线性变换来执行VTLN。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号