A Posteriori and a Priori Transformations for Speaker Adaptation in Large Vocabulary Speech Recognition Systems

机译：大型词汇语音识别系统中扬声器适应的后验和先验变换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corresponding speaker-independent recognizers. The aim of speaker adaptation techniques is to enhance the speaker-independent acoustic models to bring their recognition accuracy as close as possible to the one obtained with speaker-dependent models. In this paper, we propose a method using test and training data for acoustic model adaptation. This method operates in two steps. The first one performs an a priori adaptation using the transcribed training data of the closest training speakers to the test speaker. This adaptation is done with MAP procedure allowing reduced variances in the acoustic models. The second one performs an a posteriori adaptation using the MLLR procedure on the test data, allowing mapping of Gaussians means to match the test speaker's acoustic space. This adaptation strategy was evaluated in a large vocabulary speech recognition task. Our method leads to a relative gain of 15% with respect to the baseline system and 10% with respect to the conventional MLLR adaptation.

机译：与相应的扬声器 - 独立识别器相比，基于扬声器的基于HMM的识别器提供了较低的错误率。扬声器适配技术的目的是增强扬声器 - 独立的声学模型，以使其识别精度尽可能靠近用扬声器依赖模型获得。在本文中，我们提出了一种使用测试和培训数据进行声学模型适应的方法。该方法以两个步骤操作。第一个使用最近训练扬声器的转录训练数据来执行先验的适应性。使用MAP过程完成这种适配，允许声学模型中的差异降低。第二个使用测试数据上的MLLR过程执行后验，允许高斯映射意味着匹配测试扬声器的声学空间。这种适应策略在大型词汇表识别任务中进行了评估。我们的方法相对于基线系统的相对增益为15％，相对于传统的MLL适应而导致10％。

著录项

来源
《European conference on speech communication and technology》|2001年||共4页
会议地点
作者
Driss Matrouf; Olivier Bellot; Pascal Nocera; Georges Linares; Jean-Francois Bonastre;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
2. Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech [J] . Biswajit Das, Sandipan Mandal, Pabitra Mitra, Pattern recognition letters . 2013,第3期

机译：说话人适应技术对语音的老化识别：中词汇连续孟加拉语语音研究
3. Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Pattern recognition letters . 2017,第octa15期

机译：基于深度神经网络的语音识别和说话人自适应的插件最大后验解码器的分层贝叶斯组合
4. A Posteriori and a Priori Transformations for Speaker Adaptation in Large Vocabulary Speech Recognition Systems [C] . Driss Matrouf, Olivier Bellot, Pascal Nocera, European conference on speech communication and technology . 2001

机译：大型词汇语音识别系统中扬声器适应的后验和先验变换
5. Discriminative training for speaker adaptation and minimum Bayes risk estimation in large vocabulary speech recognition. [D] . Doumpiotis, Vlasios. 2005

机译：大词汇量语音识别中的说话人适应性和最低贝叶斯风险估计的判别训练。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speaker Clustering And Transformation For Speaker Adaptation In Large-Vocabulary Speech Recognition Systems [O] . M. Padmanabhan, L. R. Bahl, D. Nahamoo, 1995

机译：大词汇量语音识别系统中说话人聚类和说话人适应的转换

A Posteriori and a Priori Transformations for Speaker Adaptation in Large Vocabulary Speech Recognition Systems

摘要

著录项

相似文献

相关主题

期刊订阅