...
首页> 外文期刊>Cognitive Computation >Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions
【24h】

Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions

机译:利用分形维数增强特征提取过程以实现自动语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

Mel frequency cepstral coefficients (MFCCs) are a standard tool for automatic speech recognition (ASR), but they fail to capture part of the dynamics of speech. The nonlinear nature of speech suggests that extra information provided by some nonlinear features could be especially useful when training data are scarce or when the ASR task is very complex. In this paper, the Fractal Dimension of the observed time series is combined with the traditional MFCCs in the feature vector in order to enhance the performance of two different ASR systems. The first is a simple system of digit recognition in Chinese, with very few training examples, and the second is a large vocabulary ASR system for Broadcast News in Spanish.
机译:梅尔频率倒谱系数(MFCC)是用于自动语音识别(ASR)的标准工具,但是它们无法捕获语音动态的一部分。语音的非线性性质表明,当训练数据稀少或ASR任务非常复杂时,某些非线性功能提供的额外信息可能特别有用。在本文中,将观测到的时间序列的分形维数与传统MFCC结合在特征向量中,以增强两个不同的ASR系统的性能。第一个是中文的简单数字识别系统,几乎没有培训示例,第二个是西班牙语的广播新闻大词汇量ASR系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号