Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech

机译：捷克语大词汇量连续语音识别中有监督和无监督说话人适应

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper deals with the problem of efficient speaker adaptation in large vocabulary continuous speech recognition (LVCSR) systems. The main goal is to adapt acoustic models of speech and to increase the recognition accuracy of these systems in tasks, where only one user is expected (e.g. voice dictation) or where the speaking person can be identified automatically (e.g. broadcast news transcription). For this purpose, we propose several modifications of the well known MLLR (Maximum Likelihood Linear Regression) method and we combine them with the MAP (Maximum A Posteriori) method. The results from a series of experiments show that the error rate of our 300K-word Czech recogniser can be reduced by about 9.9 % when only 30 seconds of supervised data are used for adaptation or by about 9.6 % when unsupervised adaptation on the same data is performed.

机译：本文探讨了大型词汇连续语音识别（LVCSR）系统中有效的说话人适应问题。主要目标是适应语音的声学模型并提高这些系统在任务中的预期任务中的识别准确性，这些任务只需要一个用户（例如语音听写），或者可以自动识别讲话者（例如广播新闻转录）。为此，我们提出了对众所周知的MLLR（最大似然线性回归）方法的几种修改，并将它们与MAP（最大后验线性）方法结合在一起。一系列实验的结果表明，当仅使用30秒的有监督数据进行自适应时，我们的30万字捷克识别器的错误率可降低9.9％；如果对相同数据进行无监督适应，则可将错误率降低约9.6％。执行。

著录项

来源
《International Conference on Text, Speech and Dialogue(TSD 2005); 20050912-15; Karlovy Vary(CZ)》|2005年|P.203-210|共8页
会议地点 Karlovy Vary(CZ)
作者
Petr Cerva; Jan Nouza;
展开▼
作者单位

SpeechLab, Technical University of Liberec Halkova 6, 461 17, Liberec 1, Czech Republic;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech [J] . Biswajit Das, Sandipan Mandal, Pabitra Mitra, Pattern recognition letters . 2013,第3期

机译：说话人适应技术对语音的老化识别：中词汇连续孟加拉语语音研究
2. Cooperative supervised and unsupervised learning algorithm for phoneme recognition in continuous speech and speaker-independent context [J] . Najet Arous, Noureddine Ellouze Neurocomputing . 2003,第Apr期

机译：连续语音和说话者无关上下文中的协作有监督和无监督学习算法用于音素识别
3. Comparison of Non-native Speaker Adaptations for Large Vocabulary Continuous Mandarin Speech Recognition [J] . Hong Wei, Jian Yang, Yuanyuan Pu Zhengpeng Zhao International Journal of Information Technology . 2005,第07期

机译：大词汇量连续汉语普通话语音识别的非母语说话人适应性比较
4. Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech [C] . Petr Cerva, Jan Nouza International Conference on Text, Speech and Dialogue . 2005

机译：捷克大词汇连续语音识别的监督和无监督的扬声器适应
5. Real-time speaker -independent large vocabulary continuous speech recognition. [D] . Li, Xiaolong. 2005

机译：实时独立于说话者的大词汇量连续语音识别。
6. Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition [O] . Sankaranarayanan Ananthakrishnan, Shrikanth Narayanan -1

机译：类别韵律模型的无监督适应用于韵律标记和语音识别
7. MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech [O] . Cerva P., Nouza J. 2004

机译：基于MAP的捷克语超大词汇语音识别中的说话人自适应

Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅