Generalized mel frequency cepstral coefficients forlarge-vocabulary speaker-independent continuous-speech recognition

Vergin R.; OShaughnessy D.; Farhat A.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Generalized mel frequency cepstral coefficients forlarge-vocabulary speaker-independent continuous-speech recognition

【24h】

Generalized mel frequency cepstral coefficients forlarge-vocabulary speaker-independent continuous-speech recognition

机译：广义梅尔频率倒谱系数用于大词汇量独立于说话人的连续语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCCs), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system

机译：连续语音识别过程的重点是根据一些最佳标准将输入信号与一组单词或句子匹配。此过程的第一步是参数化，其主要任务是通过将输入信号转换为参数同时保留几乎所有与文本消息有关的语音信号信息来减少数据量。这一贡献提出了对广泛使用的参数集，梅尔频率倒谱系数（MFCC）的详细分析，并提出了一种考虑到频谱中整个能量区域的新参数化方法。利用拟议的新系数获得的结果给出了在大型词汇独立于说话人的连续语音识别系统中使用它们的置信区间

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |1999年第5期|p.525-532|共8页
作者
Vergin R.; OShaughnessy D.; Farhat A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
cepstral analysis; parameter estimation; speech recognition; confidence interval; energy zone; generalized mel frequency cepstral coefficients; input signal; interpolation; large-vocabulary continuous-speech recognition; optimality criteria; parameters; sentences; s;

机译：倒谱分析;参数估计;语音识别;置信区间;能量区;广义梅尔频率倒谱系数;输入信号;内插;大词汇连续语音识别;最优性标准;参数;句子;s;

相似文献

外文文献
中文文献
专利

1. Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition [J] . Vergin R., OShaughnessy D. IEEE Transactions on Speech and Audio Proceeding . 1999,第5期

机译：大型mel倒谱系数用于大词汇独立于说话人的连续语音识别
2. Analysis on Mel Frequency Cepstral Coefficients and Linear Predictive Cepstral Coefficients as Feature Extraction on Automatic Accents Identification [J] . Noraziahtulhidayu Kamarudin, S. A. R. Al-Haddad, Asem Khmag, International Journal of Applied Engineering Research . 2016,第11aPta2期

机译：基于口音自动识别的特征频率提取的梅尔频率倒谱系数和线性预测倒谱系数分析
3. Vocal Fold Pathology Assessment Using Mel-Frequency Cepstral Coefficients and Linear Predictive Cepstral Coefficients Features [J] . Jennifer C. Saldanha, T. Ananthakrishna, Rohan Pinto Journal of Medical Imaging and Health Informatics . 2014,第2期

机译：使用Mel频率倒谱系数和线性预测倒谱系数功能进行人声折叠病理评估
4. A comparative between Mel Frequency Cepstral Coefficients (MFCC) and Inverse Mel Frequency Cepstral Coefficients (IMFCC) features for an Automatic Bird Species Recognition System [C] . Angel David Pedroza Ramirez, Jose Ismael de la Rosa Vargas, Rogelio Rosas Valdez, IEEE Latin American Conference on Computational Intelligence . 2018

机译：梅尔频率倒谱系数（MFCC）和反向梅尔频率倒谱系数（IMFCC）功能的比较，用于鸟类自动识别系统
5. Development of a speech recognition system using the Mel Frequency Cepstrum Coefficient method. [D] . Mahajan, Mayur. 2016

机译：使用梅尔频率倒谱系数方法开发语音识别系统。
6. Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features [O] . Ömer Eskidere, Ahmet Gürhanlı 2015

机译：基于多锥梅尔频率倒谱系数特征的语音障碍分类
7. Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients [O] . Jia-Shing Sheu, Ching-Wen Chen 2020

机译：语音识别和使用熔融频率患者的标记

Generalized mel frequency cepstral coefficients forlarge-vocabulary speaker-independent continuous-speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅