首页> 外文会议>International Conference on Signal Processing(ICSP'04) vol.1; 20040831-0904; Beijing(CN) >A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION

【24h】

A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION

机译：多频带语音分类的重构相位空间和倒谱系数的比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper examines the use of multi-band reconstructed phase spaces as models for phoneme classification. Sub-banding reconstructed phase spaces combines linear, frequency-based techniques with a nonlinear modeling approach to speech recognition. Experiments comparing the effects of filtering speech signals for both reconstructed phase space and traditional speech recognition approaches are presented. These experiments study the use of two non-overlapping sub-bands for isolated phoneme classification on the TIMIT corpus. It is shown that while classification accuracy using Mel frequency cepstral coefficients as features does not improve with sub-banding, the accuracy increases from 36.1% to 42.0% using sub-banded reconstructed phase spaces to model the phonemes.

机译：本文研究了使用多频带重构相空间作为音素分类模型。子带重构相空间将基于频率的线性技术与用于语音识别的非线性建模方法结合在一起。实验比较了在重构相空间和传统语音识别方法中对语音信号进行滤波的效果。这些实验研究了TIMIT语料库上两个非重叠子带对孤立音素分类的使用。结果表明，虽然使用Mel频率倒谱系数作为特征的分类精度不会随着子带的提高而提高，但使用子带重构相空间对音素进行建模的精度从36.1％提高到42.0％。

著录项

来源
《International Conference on Signal Processing(ICSP'04) vol.1; 20040831-0904; Beijing(CN) 》|2004年|P.634-637|共4页
会议地点 Beijing(CN)
作者
Kevin M. Indrebo; Richard J. Povinelli; Michael T. Johnson;
展开▼
作者单位

Department of Electrical and Computer Engineering Marquette University, Milwaukee, WI USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工） ;
关键词

相似文献

外文文献
中文文献
专利

1. Time-domain isolated phoneme classification using reconstructed phase spaces [J] . Johnson M.T., Povinelli R.J., Lindgren A.C., IEEE Transactions on Speech and Audio Proceessing . 2005 ,第4期

机译：使用重构相空间的时域隔离音素分类
2. Phoneme classification in reconstructed phase space with convolutional neural networks [J] . Wesley R. John, Khan A. Nayeemulla, Shahina A. Pattern recognition letters . 2020 ,第Jula期

机译：卷积神经网络重建阶段空间中的音素分类
3. MLP-based isolated phoneme classification using likelihood features extracted from reconstructed phase space [J] . Yasser Shekofteh, Farshad Almasganj, Ayoub Daliri Engineering Applications of Artificial Intelligence . 2015 ,第SEPa期

机译：使用从重构相空间提取的似然特征的基于MLP的孤立音素分类
4. A comparison of reconstructed phase spaces and cepstral coefficients for multi-band phoneme classification [C] . Indrebo, K.M., Povinelli, . 2004

机译：多频带音素分类的重构相空间和倒频谱系数的比较
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features [O] . Ömer Eskidere, Ahmet Gürhanlı 2015

机译：基于多锥梅尔频率倒谱系数特征的语音障碍分类
7. A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION [O] . Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson 2008

机译：用于多波段频率分类的重构相空间和次幂系数的比较

A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION

摘要

著录项

相似文献

相关主题

期刊订阅