首页> 外文会议>Advances in nonlinear speech processing >Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition

【24h】

Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition

机译：结合梅尔频率倒谱系数和分形维数以进行自动语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hidden Markov Models and Mel Frequency Cepstral Coefficients (MFCC's) are a sort of standard for Automatic Speech Recognition (ASR) systems, but they fail to capture the nonlinear dynamics of speech that are present in the speech waveforms. The extra information provided by the nonlinear features could be especially useful when training data is scarce, or when the ASR task is very complex. In this work, the Fractal Dimension (FD) of the observed time series is combined with the traditional MFCC's in the feature vector in order to enhance the performance of two different ASR systems: the first one is a very simple one, with very few training examples, and the second one is a Large Vocabulary Continuous Speech Recognition System for Broadcast News.

机译：隐马尔可夫模型和梅尔频率倒谱系数（MFCC）是自动语音识别（ASR）系统的一种标准，但是它们无法捕获语音波形中存在的非线性语音动态。当训练数据稀少或ASR任务非常复杂时，非线性功能提供的额外信息可能特别有用。在这项工作中，将观测到的时间序列的分形维数（FD）与特征向量中的传统MFCC相结合，以增强两种不同的ASR系统的性能：第一个是非常简单的系统，只需很少的训练例子，第二个是广播新闻的大词汇量连续语音识别系统。

著录项

来源
《Advances in nonlinear speech processing》|2011年|p.183-189|共7页
会议地点 Las Palmas de Gran Canaria(ES);Las Palmas de Gran Canaria(ES)
作者
Aitzol Ezeiza; Karmele Lopez de Ipina; Carmen Hernandez; Nora Barroso;
展开▼
作者单位

Department of System Engineering and Automation, University of the Basque Country, Spain;

Department of System Engineering and Automation, University of the Basque Country, Spain;

Department of System Engineering and Automation, University of the Basque Country, Spain;

Irunweb Enterprise, Auzolan 2B - 2, Irun, Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
nonlinear speech processing; automatic speech recognition; mel frequency cepstral coefficients; fractal dimensions;

机译：非线性语音处理；自动语音识别；梅尔频率倒谱系数；分形维数;

相似文献

外文文献
中文文献
专利

1. Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures [J] . Darch J, Milner B, Vaseghi S The Journal of the Acoustical Society of America . 2008,第6期

机译：分布式语音识别架构中基于mel-频率倒谱系数的声学语音特征分析和预测
2. Higher Order Mel-Frequency Cepstral and Autoregressive Reflection Coefficients in Recognizing Three Dimensions of Speech Emotions [J] . A. Milton, S. Tamil Selvi International Journal of Electronics Engineering Research . 2015,第2期

机译：识别语音情感的三个维度的高阶梅尔频率倒谱和自回归反射系数
3. Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN [J] . Kumaran U., Radha Rammohan S., Nagarajan Senthil Murugan, International journal of speech technology . 2021,第2期

机译：MEL和γ和γ和γ频率倒谱系数使用深C-RNN的语音情感识别
4. Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition [C] . Aitzol Ezeiza, Karmele Lopez de Ipina, Carmen Hernandez, International Conference on Advances in Nonlinear Speech Processing . 2011

机译：组合MEL频率薄膜系数和分形尺寸进行自动语音识别
5. Development of a speech recognition system using the Mel Frequency Cepstrum Coefficient method. [D] . Mahajan, Mayur. 2016

机译：使用梅尔频率倒谱系数方法开发语音识别系统。
6. Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features [O] . Ömer Eskidere, Ahmet Gürhanlı 2015

机译：基于多锥梅尔频率倒谱系数特征的语音障碍分类
7. Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures [O] . Darch, Jonathan, Milner, Ben, Vaseghi, Saeed 2008

机译：分布式语音识别架构中基于mel频率倒谱系数的语音特征分析和预测

Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅