首页> 外文会议> >Utterance normalization using vowel features in a spoken word recognition system for multiple speakers
【24h】

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

机译:在多个说话者的语音识别系统中使用元音功能进行话语归一化

获取原文

摘要

The authors propose a novel method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate, especially for smaller numbers of templates per word. The proposed method gives consistently higher word recognition scores than the four-dimensional representation on the Karhunen-Loeve transformation, and also gives higher scores than the original 16-dimensional representation of filter-bank outputs, especially when the number of templates is small. Together with the fact that this method reduces the dimension of the feature vector by a factor of four, the results demonstrate the validity of the proposed method.
机译:作者提出了一种新的归一化方法,该方法基于输入语音的声学特征的线性变换,每个说话者仅使用五个日语元音中的每个独立发音即可。将提出的归一化方法与多模板DP匹配相结合的孤立单词识别实验表明,识别率有了显着提高,尤其是对于每个单词的模板数量较少的情况。与Karhunen-Loeve变换上的四维表示相比,该方法给出的单词识别分数始终较高,并且与原始16维表示的滤波器组输出相比,得分更高,尤其是在模板数量较少时。结合该方法将特征向量的维数减小四倍的事实,结果证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号