Time-domain isolated phoneme classification using reconstructed phase spaces

Johnson M.T.; Povinelli R.J.; Lindgren A.C.; Jinjin Ye; Xiaolin Liu; Indrebo K.M.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Time-domain isolated phoneme classification using reconstructed phase spaces

【24h】

Time-domain isolated phoneme classification using reconstructed phase spaces

机译：使用重构相空间的时域隔离音素分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a novel time-domain approach to modeling and classifying speech phoneme waveforms. The approach is based on statistical models of reconstructed phase spaces, which offer significant theoretical benefits as representations that are known to be topologically equivalent to the state dynamics of the underlying production system. The lag and dimension parameters of the reconstruction process for speech are examined in detail, comparing common estimation heuristics for these parameters with corresponding maximum likelihood recognition accuracy over the TIMIT data set. Overall accuracies are compared with a Mel-frequency cepstral baseline system across five different phonetic classes within TIMIT, and a composite classifier using both cepstral and phase space features is developed. Results indicate that although the accuracy of the phase space approach by itself is still currently below that of baseline cepstral methods, a combined approach is capable of increasing speaker independent phoneme accuracy.

机译：本文介绍了一种新颖的时域方法来对语音音素波形进行建模和分类。该方法基于重构相空间的统计模型，该模型提供了显着的理论收益，作为已知的表示在拓扑上等同于基础生产系统的状态动态的表示形式。详细检查了语音重建过程的滞后和维度参数，将这些参数的通用估计启发式方法与TIMIT数据集上的相应最大似然识别精度进行了比较。将整体精度与TIMIT内五个不同语音分类的Mel频率倒谱基线系统进行比较，并开发了同时使用倒谱和相空间特征的复合分类器。结果表明，尽管相空间方法本身的准确性目前仍低于基线倒谱方法的准确性，但是组合方法能够提高说话者独立音素的准确性。

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |2005年第4期|p.458-466|共9页
作者
Johnson M.T.; Povinelli R.J.; Lindgren A.C.; Jinjin Ye; Xiaolin Liu; Indrebo K.M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
cepstral analysis; maximum likelihood estimation; pattern classification; phase space methods; speech recognition; time-domain analysis; composite classifier; maximum likelihood recognition; mel-frequency cepstral baseline system; nonlinear systems; reconstructed;

机译：倒谱分析;最大似然估计;模式分类;相空间方法;语音识别;时域分析;复合分类器;最大似然识别;梅尔频率倒谱基线系统;非线性系统;重构;

相似文献

外文文献
中文文献
专利

1. MLP-based isolated phoneme classification using likelihood features extracted from reconstructed phase space [J] . Yasser Shekofteh, Farshad Almasganj, Ayoub Daliri Engineering Applications of Artificial Intelligence . 2015,第SEPa期

机译：使用从重构相空间提取的似然特征的基于MLP的孤立音素分类
2. Phoneme classification in reconstructed phase space with convolutional neural networks [J] . Wesley R. John, Khan A. Nayeemulla, Shahina A. Pattern recognition letters . 2020,第Jula期

机译：卷积神经网络重建阶段空间中的音素分类
3. Statistical Models of Reconstructed Phase Spaces for Signal Classification [J] . Richard J. Povinelli, Michael T. Johnson, Andrew C. Lindgren, IEEE Transactions on Signal Processing . 2006,第6期

机译：用于信号分类的重构相空间统计模型
4. A comparison of reconstructed phase spaces and cepstral coefficients for multi-band phoneme classification [C] . Indrebo, K.M., Povinelli, . 2004

机译：多频带音素分类的重构相空间和倒频谱系数的比较
5. An Unsupervised Cluster: Learning Water Customer Behavior Using Variation of Information on a Reconstructed Phase Space [D] . Malinowski, Michele Rae Bizub. 2018

机译：一个无监督的集群：使用重构相空间上的信息变化来学习水用户行为
6. The Approach for Action Recognition Based on the Reconstructed Phase Spaces [O] . Hong-bin Tu, Li-min Xia -1

机译：基于重构相空间的动作识别方法
7. A COMPARISON OF RECONSTRUCTED PHASE SPACES AND CEPSTRAL COEFFICIENTS FOR MULTI-BAND PHONEME CLASSIFICATION [O] . Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson 2008

机译：用于多波段频率分类的重构相空间和次幂系数的比较
8. Iterated Class-Specific Subspaces for Speaker-Dependent Phoneme Classification [R] . Baggenstoss, P. M. 2008

机译：用于说话者相关音素分类的迭代类特定子空间

Time-domain isolated phoneme classification using reconstructed phase spaces

摘要

著录项

相似文献

相关主题

期刊订阅