...
首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Generalized mel frequency cepstral coefficients forlarge-vocabulary speaker-independent continuous-speech recognition
【24h】

Generalized mel frequency cepstral coefficients forlarge-vocabulary speaker-independent continuous-speech recognition

机译:广义梅尔频率倒谱系数用于大词汇量独立于说话人的连续语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCCs), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system
机译:连续语音识别过程的重点是根据一些最佳标准将输入信号与一组单词或句子匹配。此过程的第一步是参数化,其主要任务是通过将输入信号转换为参数同时保留几乎所有与文本消息有关的语音信号信息来减少数据量。这一贡献提出了对广泛使用的参数集,梅尔频率倒谱系数(MFCC)的详细分析,并提出了一种考虑到频谱中整个能量区域的新参数化方法。利用拟议的新系数获得的结果给出了在大型词汇独立于说话人的连续语音识别系统中使用它们的置信区间

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号