首页> 外文会议>European Signal Processing Conference >WARPED DISCRETE COSINE TRANSFORM CEPSTRUM: A NEW FEATURE FOR SPEECH PROCESSING
【24h】

WARPED DISCRETE COSINE TRANSFORM CEPSTRUM: A NEW FEATURE FOR SPEECH PROCESSING

机译:翘曲离散余弦变换凯斯特鲁姆:语音处理的新功能

获取原文
获取外文期刊封面目录资料

摘要

In this paper, we propose a new feature for speech recognition and speaker identification application. The new feature is termed as warped-discrete cosine transform cepstrum (WDCTC). The feature is obtained by replacing the discrete cosine transform (DCT) by the warped discrete cosine transform (WDCT, [4]) in the discrete cosine transform cepstrum (DCTC [2]). The WDCT is implemented as a cascade of the DCT and IIR all-pass filters. We incorporate a nonlinear frequency-scale in DCTC which closely follows the barkscale. This is accomplished by setting the all-pass filter parameter using an expression given by Smith and Abel [5]. Performance of WDCTC is compared to mel-frequency cepstral coefficients (MFCC) in a speech recognition and speaker identification experiment. WDCTC outperforms MFCC in both noisy and noiseless conditions.
机译:在本文中,我们为语音识别和扬声器识别应用提出了一种新功能。新功能被称为翘曲离散余弦变换谱(WDCTC)。通过在离散余弦变换谱系中通过翘曲的离散余弦变换(WDCT,[4])替换离散余弦变换(DCT)来获得该特征(DCTC [2])。 WDCT实现为DCT和IIR All-Pass滤镜的级联。我们在DCTC中纳入了一个非线性频率级,这密切关注Barkscale。这是通过使用史密斯和abel [5]给出的表达式设置All-Pass滤波器参数来实现的。在语音识别和扬声器识别实验中将WDCTC的性能与Mel-频谱系统系数(MFCC)进行比较。 WDCTC在嘈杂和无噪声条件下表现出MFCC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号