Posterior-Based Features and Distances in Template Matching for Speech Recognition

机译：基于后的语音识别模板的特征和距离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The use of large speech corpora in example-based approaches for speech recognition is mainly focused on increasing the number of examples. This strategy presents some difficulties because databases may not provide enough examples for some rare words. In this paper we present a different method to incorporate the information contained in such corpora in these example-based systems. A multilayer perceptron is trained on these databases to estimate speaker and task-independent phoneme posterior probabilities, which are used as speech features. By reducing the variability of features, fewer examples are needed to properly characterize a word. In this way, performance can be highly improved when limited number of examples is available. Moreover, we also study posterior-based local distances, these result more effective than traditional Euclidean distance. Experiments on Phonebook database support the idea that posterior features with a proper local distance can yield competitive results.

机译：在基于示例的语音识别方法中使用大型语音语料库主要集中在增加示例的数量。此策略提出了一些困难，因为数据库可能无法为某些稀有字提供足够的示例。在本文中，我们介绍了一种不同的方法，将这些基于示例性的系统中包含的信息纳入其中包含的信息。 Multilayer Perceptron在这些数据库中培训，以估计扬声器和任务无关的音素后续概率，其用作语音功能。通过降低特征的可变性，需要更少的示例来正确地表征单词。以这种方式，当有限数量的示例可用时，可以高度改善性能。此外，我们还研究了基于后的局部距离，这些结果比传统的欧几里德距离更有效。电话簿数据库的实验支持了具有适当局部距离的后部功能可以产生竞争结果。

著录项

来源
《International Workshop on Machine Learning for Multimodal Interaction》|2008年||共11页
会议地点
作者
Guillermo Aradilla; Herve Bourlard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Speech Recognition; Template Matching; Posterior Features; KL-divergence; Bhattacharyya; Multi-Layer Perceptron;

机译：语音识别;模板匹配;后部特征;KL分歧;BHATTACHARYYA;多层的感觉;

相似文献

外文文献
中文文献
专利

1. A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition [J] . Wooil Kim, Hansen J.H.L. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：一种基于后验的代表性均值估计的新的掩模估计方法用于特征缺失语音识别
2. Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition [J] . Yanqing SUN, Yu ZHOU, Qingwei ZHAO, IEICE transactions on information and systems . 2010,第9期

机译：使用熵信息进行语音识别增强基于后验的置信度的鲁棒性
3. Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition [J] . Yanqing SUN, Yu ZHOU, Qingwei ZHAO, IEICE Transactions on Information and Systems . 2010,第9期

机译：使用熵信息进行语音识别增强基于后验的置信度
4. Posterior-Based Features and Distances in Template Matching for Speech Recognition [C] . Guillermo Aradilla, Herve Bourlard International Workshop on Machine Learning for Multimodal Interaction;MLMI 2008 . 2008

机译：语音识别模板匹配中基于后验的特征和距离
5. Integrate template matching and statistical modeling for continuous speech recognition. [D] . Sun, Xie. 2011

机译：集成模板匹配和统计建模，可进行连续语音识别。
6. Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates [O] . Yuedong Yang, Eshel Faraggi, Huiying Zhao, -1

机译：通过在查询的预测的一维结构特性与模板的相应本机特性之间采用基于概率的匹配改善蛋白质折叠识别和基于模板的建模
7. Posterior-Based Features and Distances in Template Matching for Speech Recognition [O] . Guillermo Aradilla A, Hervé Bourlard A, Guillermo Aradilla, 2007

机译：基于后验的特征和模板匹配中的距离用于语音识别

Posterior-Based Features and Distances in Template Matching for Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅