首页> 外文会议>European Conference on Speech Communication and Technology >Extraction Methods of Voicing Feature for Robust Speech Recognition
【24h】

Extraction Methods of Voicing Feature for Robust Speech Recognition

机译:强大的语音识别中发声功能的提取方法

获取原文
获取外文期刊封面目录资料

摘要

In this paper, three different voicing features are studied as additional acoustic features for continuous speech recognition. The harmonic product spectrum based feature is extracted in frequency domain while the autocorrelation and the average magnitude difference based methods work in time domain. The algorithms produce a measure of voicing for each time frame. The voicing measure was combined with the standard Mel Frequency Cepstral Coefficients (MFCC) using linear discriminant analysis to choose the most relevant features. Experiments have been performed on small and large vocabulary tasks. The three different voicing measures combined with MFCCs resulted in similar improvements in word error rate: improvements of up to 14% on the small-vocabulary task and improvements of up to 6% on the large-vocabulary task relative to using MFCC alone with the same overall number of parameters in the system.
机译:在本文中,研究了三种不同的发声特征,作为额外的声学特征,用于连续语音识别。在频域中提取谐波产品频谱的特征,而自相关和基于平均幅度差异的方法在时域中工作。该算法产生每次帧的发声量度。使用线性判别分析与标准MEL频率谱系数(MFCC)与标准MEL频率谱系数(MFCC)相结合,以选择最相关的功能。已经对小型和大词汇组织进行了实验。与MFCC相结合的三种不同的发声措施导致了单词错误率类似的改进:在小词汇任务上的改进高达14%,而且在大词汇表中相对于使用MFCC的大号任务的提高最多可提高6%系统中的总数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号