首页> 外文会议>International Conference on Signal Processing(ICSP'04) vol.1; 20040831-0904; Beijing(CN) >A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION
【24h】

A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION

机译:多频带语音分类的重构相位空间和倒谱系数的比较

获取原文
获取原文并翻译 | 示例

摘要

This paper examines the use of multi-band reconstructed phase spaces as models for phoneme classification. Sub-banding reconstructed phase spaces combines linear, frequency-based techniques with a nonlinear modeling approach to speech recognition. Experiments comparing the effects of filtering speech signals for both reconstructed phase space and traditional speech recognition approaches are presented. These experiments study the use of two non-overlapping sub-bands for isolated phoneme classification on the TIMIT corpus. It is shown that while classification accuracy using Mel frequency cepstral coefficients as features does not improve with sub-banding, the accuracy increases from 36.1% to 42.0% using sub-banded reconstructed phase spaces to model the phonemes.
机译:本文研究了使用多频带重构相空间作为音素分类模型。子带重构相空间将基于频率的线性技术与用于语音识别的非线性建模方法结合在一起。实验比较了在重构相空间和传统语音识别方法中对语音信号进行滤波的效果。这些实验研究了TIMIT语料库上两个非重叠子带对孤立音素分类的使用。结果表明,虽然使用Mel频率倒谱系数作为特征的分类精度不会随着子带的提高而提高,但使用子带重构相空间对音素进行建模的精度从36.1%提高到42.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号