首页> 外文会议>IEEE international conference on acoustics, speech, and signal processing >Utterance normalization using vowel features in a spoken word recognition system for multiple speakers
【24h】

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

机译:用于多个扬声器的口语识别系统中的元音特征的话语归一化

获取原文

摘要

The authors propose a novel method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate, especially for smaller numbers of templates per word. The proposed method gives consistently higher word recognition scores than the four-dimensional representation on the Karhunen-Loeve transformation, and also gives higher scores than the original 16-dimensional representation of filter-bank outputs, especially when the number of templates is small. Together with the fact that this method reduces the dimension of the feature vector by a factor of four, the results demonstrate the validity of the proposed method.
机译:作者提出了一种基于输入演讲的声学特征的线性变换,仅使用每个单独的扬声器的五个元音的一个分离的话语来提出了一种基于输入语音的线性变换的新颖性。孤立字识别的实验结合所提出的归一化方法和多模板DP匹配的识别率显着改善,特别是对于每个单词的较少数量的模板。该方法提供比Karhunen-Loeve变换的四维表示始终如一的单词识别分数,并且还提供比滤波器存储体输出的原始16维表示更高的分数,尤其是当模板的数量小时。与该方法减少了特征向量的尺寸的事实,结果证明了所提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号