Speaker clustering and transformation for speaker adaptation in speech recognition systems

Padmanabhan M.; Bahl L.R.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Speaker clustering and transformation for speaker adaptation in speech recognition systems

【24h】

Speaker clustering and transformation for speaker adaptation in speech recognition systems

机译：语音识别系统中的说话人适应和说话人聚类和转换

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test speaker, and using only the data from these speakers (rather than the complete training corpus) to reestimate the system parameters. Further, a linear transformation is computed for every one of the selected training speakers to better map the training speaker's data to the test speaker's acoustic space. Finally, the system parameters (Gaussian means) are reestimated specifically for the test speaker using the transformed data from the selected training speakers. Experiments showed that this scheme is capable of providing an 18% relative improvement in the error rate on a large-vocabulary task with the use of as little as three sentences of adaptation data.

机译：描述了说话人适应策略，该策略基于从训练集中找到在听觉上靠近测试说话人的说话人子集，并仅使用来自这些说话人的数据（而不是完整的训练语料库）来重新估计系统参数。此外，为每个选定的训练说话者计算线性变换，以更好地将训练说话者的数据映射到测试说话者的声学空间。最后，使用来自所选训练说话者的转换数据，专门针对测试说话者重新估计系统参数（高斯均值）。实验表明，该方案通过使用少至三个句子的自适应数据，就可以将大型词汇任务的错误率相对提高18％。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |1998年第1期|P.71-77|共7页
作者
Padmanabhan M.; Bahl L.R.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker clustering and transformation for speaker adaptation inspeech recognition systems [J] . Padmanabhan M., Bahl L.R., Nahamoo D., IEEE Transactions on Speech and Audio Proceessing . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
2. Speech recognition using speaker adaptation by system parameter transformation [J] . Hao Y. IEEE Transactions on Speech and Audio Proceeding . 1994,第1期

机译：通过系统参数转换使用说话人自适应进行语音识别
3. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
4. Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems [C] . Padmanabhan, M., Bahl, . 1996

机译：大词汇量语音识别系统中说话人的聚类和转换，以适应说话人
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speaker Clustering And Transformation For Speaker Adaptation In Large-Vocabulary Speech Recognition Systems [O] . M. Padmanabhan, L. R. Bahl, D. Nahamoo, 1995

机译：大词汇量语音识别系统中说话人聚类和说话人适应的转换
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Speaker clustering and transformation for speaker adaptation in speech recognition systems

摘要

著录项

相似文献

相关主题

期刊订阅