Bayesian adaptation of speech recognizers to field speech data

机译：贝叶斯语音识别器对现场语音数据的适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The article studies a Bayesian (or Maximum A Posteriori MAP) approach to the adaptation of continuous density hidden Markov models (CDHMMs) to a specific condition of a speech recognition application. In order to improve the model robustness, CDHMMs formerly trained from laboratory data are then adapted using context dependent field utterances. Two specific problems have to be faced when using the MAP approach: the estimation of the a priori distribution parameters and the lack of field adaptation data for some distributions of the CDHMM. To estimate the a priori distribution parameters, we need to identify different realizations of the model parameters. Three different solutions are proposed and evaluated. To overcome the lack of adaptation data, field acoustical training frames may be shared among similar distributions. This is performed using an acoustical tree, obtained by progressively clustering the model distributions. Recognition results show that MAP adapted models significantly outperform those trained by maximum likelihood (ML), specifically when the field data set is small.

机译：本文研究了一种贝叶斯（或最大后验MAP）方法，以使连续密度隐藏马尔可夫模型（CDHMM）适应语音识别应用的特定条件。为了提高模型的鲁棒性，然后使用上下文相关字段话语对以前从实验室数据中训练的CDHMM进行调整。使用MAP方法时，必须面对两个具体问题：先验分布参数的估计以及CDHMM某些分布的场适应数据的缺乏。为了估计先验分布参数，我们需要确定模型参数的不同实现。提出并评估了三种不同的解决方案。为了克服缺乏适应性数据的问题，可以在类似的分布中共享现场声学训练帧。这是通过使用声学树来执行的，该声学树是通过逐步聚类模型分布而获得的。识别结果表明，适用于MAP的模型明显优于通过最大似然（ML）训练的模型，特别是在现场数据集较小时。

著录项

来源
《》|1996年|P.917-920|共4页
会议地点
作者
Miglietta; C.G.; Mokbel; C.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Cluster-based dynamic variance adaptation for interconnecting speech enhancement pre-processor and speech recognizer [J] . Marc Delcroix, Shinji Watanabe, Tomohiro Nakatani, Computer speech and language . 2013,第1期

机译：基于集群的动态方差自适应，用于互连语音增强预处理器和语音识别器
2. "A speech recognizer" a tool to recognize the high clarity speech signal based on existing speech using ISCA [J] . Velammal M. Navaneetha, Kumar P. Nirmal Analog Integrated Circuits and Signal Processing . 2019,第1期

机译：“语音识别器”一种基于使用ISCA的现有语音识别高清晰度语音信号的工具
3. Recognizing emotional speech in Persian: Avalidated database of Persian emotional speech (Persian ESD) [J] . Niloofar Keshtiari, Michael Kuhlmann, Moharram Eslami, Behavior Research Methods . 2015,第1期

机译：在波斯语中识别情绪讲话：波斯情感演讲的被培养数据库（波斯岛ESD）
4. Bayesian adaptation of speech recognizers to field speech data [C] . Miglietta C.G., Mokbel C., Institute of Electric and Electronic Engineer International Conference on Spoken Language . 1996

机译：贝叶斯的语音识别员对现场语音数据的适应
5. Mobile GIS as if field users mattered: Small is ubiquitous but can speech be recognized? [D] . Hunter, Andrew James Simpson. 2003

机译：移动GIS似乎对现场用户很重要：小型无处不在，但语音可以识别吗？
6. Reanalyzing neurocognitive data on the role of the motor system in speech perception within COSMO a Bayesian perceptuo-motor model of speech communication [O] . Marie-Lou Barnaud, Pierre Bessière, Julien Diard, -1

机译：重新分析关于运动系统在COSMO中的语音感知中的作用的神经认知数据COSMO是语音交流的贝叶斯感知-运动模型
7. Bayesian Adaptation of Speech Recognizers to Field Speech Data [O] . Carmelo Giammarco Miglietta, Chafic Mokbel, Denis JOUVET, 1996

机译：语音识别器对场语音数据的贝叶斯适应
8. Mixture Input Transformations for Adaptation of Hybrid Connectionist Speech Recognizes. [R] . Abrash, V. 1997

机译：用于混合连接主义语音识别的混合输入变换。

Bayesian adaptation of speech recognizers to field speech data

摘要

著录项

相似文献

相关主题

期刊订阅