首页> 外文期刊>ETRI journal >Intra- and Inter-frame Features for Automatic Speech Recognition
【24h】

Intra- and Inter-frame Features for Automatic Speech Recognition

机译:自动语音识别的帧内和帧间功能

获取原文
           

摘要

In this paper, alternative dynamic features for speech recognition are proposed. The goal of this work is to improve speech recognition accuracy by deriving the representation of distinctive dynamic characteristics from a speech spectrum. This work was inspired by two temporal dynamics of a speech signal. One is the highly non-stationary nature of speech, and the other is the inter-frame change of a speech spectrum. We adopt the use of a sub-frame spectrum analyzer to capture very rapid spectral changes within a speech analysis frame. In addition, we attempt to measure spectral fluctuations of a more complex manner as opposed to traditional dynamic features such as delta or double-delta. To evaluate the proposed features, speech recognition tests over smartphone environments were conducted. The experimental results show that the feature streams simply combined with the proposed features are effective for an improvement in the recognition accuracy of a hidden Markov model–based speech recognizer.
机译:在本文中,提出了语音识别的替代动态特征。这项工作的目的是通过从语音频谱中得出独特的动态特征来提高语音识别的准确性。这项工作的灵感来自于语音信号的两个时间动态。一个是语音的高度非平稳性,另一个是语音频谱的帧间变化。我们采用子帧频谱分析仪来捕获语音分析帧内非常快速的频谱变化。此外,我们尝试以更复杂的方式测量频谱波动,这与传统的动态特征(例如增量或双增量)相反。为了评估建议的功能,在智能手机环境下进行了语音识别测试。实验结果表明,将特征流与提出的特征进行简单组合可有效提高基于隐马尔可夫模型的语音识别器的识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号