Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions

Aitzol Ezeiza; Karmele López de Ipiña; Carmen Hernández; Nora Barroso

首页> 外文期刊>Cognitive Computation >Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions

【24h】

Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions

机译：利用分形维数增强特征提取过程以实现自动语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mel frequency cepstral coefficients (MFCCs) are a standard tool for automatic speech recognition (ASR), but they fail to capture part of the dynamics of speech. The nonlinear nature of speech suggests that extra information provided by some nonlinear features could be especially useful when training data are scarce or when the ASR task is very complex. In this paper, the Fractal Dimension of the observed time series is combined with the traditional MFCCs in the feature vector in order to enhance the performance of two different ASR systems. The first is a simple system of digit recognition in Chinese, with very few training examples, and the second is a large vocabulary ASR system for Broadcast News in Spanish.

机译：梅尔频率倒谱系数（MFCC）是用于自动语音识别（ASR）的标准工具，但是它们无法捕获语音动态的一部分。语音的非线性性质表明，当训练数据稀少或ASR任务非常复杂时，某些非线性功能提供的额外信息可能特别有用。在本文中，将观测到的时间序列的分形维数与传统MFCC结合在特征向量中，以增强两个不同的ASR系统的性能。第一个是中文的简单数字识别系统，几乎没有培训示例，第二个是西班牙语的广播新闻大词汇量ASR系统。

著录项

来源
《Cognitive Computation》 |2013年第4期|545-550|共6页
作者
Aitzol Ezeiza; Karmele López de Ipiña; Carmen Hernández; Nora Barroso;
展开▼
作者单位

Department of Systems Engineering and Automation University of the Basque Country UPV/EHU">(1);

Department of Systems Engineering and Automation University of the Basque Country UPV/EHU">(1);

Department of Computer Science and Artificial Intelligence University of the Basque Country UPV/EHU">(2);

Department of Systems Engineering and Automation University of the Basque Country UPV/EHU">(1);

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Nonlinear speech processing; Automatic speech recognition; Mel frequency cepstral coefficients; Fractal dimensions;

机译：非线性语音处理;自动语音识别;梅尔频率倒谱系数;分形维数;

相似文献

外文文献
中文文献
专利

1. Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions [J] . Aitzol Ezeiza, Karmele López de Ipiña, Carmen Hernández, Cognitive computation . 2013,第4期

机译：利用分形维数增强特征提取过程以实现自动语音识别
2. A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition [J] . Kentaro Ishizuka, Tomohiro Nakatani Speech Communication . 2006,第11期

机译：一种基于子带的周期性和非周期性分解与噪声鲁棒前端处理的特征提取方法，用于自动语音识别
3. Fractal dimensions of speech sounds: computation and application to automatic speech recognition. [J] . Maragos P, Potamianos A The Journal of the Acoustical Society of America . 1999,第3期

机译：语音的分形维数：自动语音识别的计算和应用。
4. Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition [C] . Aitzol Ezeiza, Karmele Lopez de Ipina, Carmen Hernandez, Advances in nonlinear speech processing . 2011

机译：结合梅尔频率倒谱系数和分形维数以进行自动语音识别
5. An automatic speech recognition oriented study on segmentation, low dimensional feature extraction, and temporal trajectory information capture. [D] . Zhu, Yonggang. 2002

机译：面向语音识别的自动研究，涉及分割，低维特征提取和时间轨迹信息捕获。
6. On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition [O] . Juraj Kacur, Boris Puterka, Jarmila Pavlovicova, 2021

机译：语音情感识别中的语音特性和特征提取方法
7. Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement [O] . Cadore Joyner, Valverde-Albacete Francisco J., Gallardo-Antolín Ascensión, 2012

机译：听觉启发的语音频谱图形态处理：自动语音识别和语音增强中的应用
8. Preprocessing and Feature Extraction for Automatic Recognition of Radar Images, [R] . chen, c. h. 1974

机译：雷达图像自动识别的预处理和特征提取，

Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions

摘要

著录项

相似文献

相关主题

期刊订阅