首页> 外文期刊>ETRI journal >Feature Extraction Based on Speech Attractors in the Reconstructed Phase Space for Automatic Speech Recognition Systems
【24h】

Feature Extraction Based on Speech Attractors in the Reconstructed Phase Space for Automatic Speech Recognition Systems

机译:自动语音识别系统重构相空间中基于语音吸引子的特征提取

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a feature extraction (FE) method is proposed that is comparable to the traditional FE methods used in automatic speech recognition systems. Unlike the conventional spectral-based FE methods, the proposed method evaluates the similarities between an embedded speech signal and a set of predefined speech attractor models in the reconstructed phase space (RPS) domain. In the first step, a set of Gaussian mixture models is trained to represent the speech attractors in the RPS. Next, for a new input speech frame, a posterior-probability-based feature vector is evaluated, which represents the similarity between the embedded frame and the learned speech attractors. We conduct experiments for a speech recognition task utilizing a toolkit based on hidden Markov models, over FARSDAT, a well-known Persian speech corpus. Through the proposed FE method, we gain 3.11% absolute phoneme error rate improvement in comparison to the baseline system, which exploits the mel-frequency cepstral coefficient FE method.
机译:本文提出了一种特征提取(FE)方法,该方法可与自动语音识别系统中使用的传统FE方法相媲美。与传统的基于频谱的有限元方法不同,该方法在重建的相空间(RPS)域中评估嵌入式语音信号与一组预定义的语音吸引器模型之间的相似性。第一步,训练一组高斯混合模型以表示RPS中的语音吸引器。接下来,对于新的输入语音帧,将评估基于后验概率的特征向量,该特征向量表示嵌入帧与学习的语音吸引子之间的相似性。我们通过著名的波斯语语料库FARSDAT,使用基于隐马尔可夫模型的工具包进行语音识别任务的实验。通过提出的有限元方法,与采用梅尔频率倒谱系数有限元方法的基线系统相比,我们获得了3.11%的绝对音素错误率改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号