首页> 外文会议>International Workshop on Machine Learning for Multimodal Interaction >Posterior-Based Features and Distances in Template Matching for Speech Recognition
【24h】

Posterior-Based Features and Distances in Template Matching for Speech Recognition

机译:基于后的语音识别模板的特征和距离

获取原文

摘要

The use of large speech corpora in example-based approaches for speech recognition is mainly focused on increasing the number of examples. This strategy presents some difficulties because databases may not provide enough examples for some rare words. In this paper we present a different method to incorporate the information contained in such corpora in these example-based systems. A multilayer perceptron is trained on these databases to estimate speaker and task-independent phoneme posterior probabilities, which are used as speech features. By reducing the variability of features, fewer examples are needed to properly characterize a word. In this way, performance can be highly improved when limited number of examples is available. Moreover, we also study posterior-based local distances, these result more effective than traditional Euclidean distance. Experiments on Phonebook database support the idea that posterior features with a proper local distance can yield competitive results.
机译:在基于示例的语音识别方法中使用大型语音语料库主要集中在增加示例的数量。此策略提出了一些困难,因为数据库可能无法为某些稀有字提供足够的示例。在本文中,我们介绍了一种不同的方法,将这些基于示例性的系统中包含的信息纳入其中包含的信息。 Multilayer Perceptron在这些数据库中培训,以估计扬声器和任务无关的音素后续概率,其用作语音功能。通过降低特征的可变性,需要更少的示例来正确地表征单词。以这种方式,当有限数量的示例可用时,可以高度改善性能。此外,我们还研究了基于后的局部距离,这些结果比传统的欧几里德距离更有效。电话簿数据库的实验支持了具有适当局部距离的后部功能可以产生竞争结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号