首页> 外文会议> >Hierarchical subband linear predictive cepstral (HSLPC) features for HMM-based speech recognition
【24h】

Hierarchical subband linear predictive cepstral (HSLPC) features for HMM-based speech recognition

机译:用于基于HMM的语音识别的分层子带线性预测倒谱(HSLPC)功能

获取原文

摘要

A new approach for linear prediction (LP) analysis is explored, where predictor can be computed from a mel-warped subband-based autocorrelation functions obtained from the power spectrum. For spectral representation a set of multi-resolution cepstral features are proposed. The general idea is to divide up the full frequency-band into several subbands, perform the IDFT on the mel power spectrum for each subband, followed by Durbin's algorithm and the standard conversion from LP to cepstral coefficients. This approach can be extended to several levels of different resolutions. Multi-resolution feature vectors, formed by concatenation of the subband cepstral features into an extended feature vector, are shown to yield better performance than the conventional mel-warped LPCCs over the full voice-bandwidth for a connected digit recognition task.
机译:探索了一种用于线性预测(LP)分析的新方法,其中可以根据从功率谱获得的基于mel翘曲子带的自相关函数来计算预测因子。对于频谱表示,提出了一组多分辨率倒谱特征。总体思路是将整个频带划分为几个子带,对每个子带的mel功率谱执行IDFT,然后执行Durbin算法和从LP到倒频谱系数的标准转换。该方法可以扩展到不同分辨率的多个级别。通过将子带倒谱特征串联到扩展特征向量中形成的多分辨率特征向量,在连接的数字识别任务的整个语音带宽上,比常规的mel-warped LPCC具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号