Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems

机译：大词汇表语音识别系统中扬声器适应的扬声器聚类和转换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test speaker, and using only the data from these speakers (rather than the complete training corpus) to re-estimate the system parameters. Further, a linear transformation is computed for every one of the selected training speakers to better map the training speaker's data to the test speaker's acoustic space. Finally, the system parameters (Gaussian means) are re-estimated specifically for the test speaker using the transformed data from the selected training speakers. Experiments showed that this scheme is capable of reducing the error rate by 10-15% with the use of as little as 3 sentences of adaptation data.

机译：描述了一种扬声器适应策略，其基于查找讲话者的子集，来自训练集，他们在声学靠近测试扬声器，并仅使用来自这些扬声器（而不是完整的培训语料库）来重新估计的数据系统参数。此外，为每个选定的训练扬声器计算线性变换，以更好地将训练扬声器的数据映射到测试扬声器的声学空间。最后，使用来自所选训练扬声器的变换数据专门针对测试扬声器重新估计系统参数（高斯手段）。实验表明，该方案能够将误差率降低10-15％，使用只需的适应数据的3句。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|1996年||共4页
会议地点
作者
Padmanabhan M.; Bahl L.R.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
2. Speaker clustering and transformation for speaker adaptation inspeech recognition systems [J] . Padmanabhan M., Bahl L.R., Nahamoo D., IEEE Transactions on Speech and Audio Proceessing . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
3. Speech recognition using speaker adaptation by system parameter transformation [J] . Hao Y. IEEE Transactions on Speech and Audio Proceeding . 1994,第1期

机译：通过系统参数转换使用说话人自适应进行语音识别
4. Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems [C] . Padmanabhan, M., Bahl, . 1996

机译：大词汇量语音识别系统中说话人的聚类和转换，以适应说话人
5. Large-vocabulary speaker-independent continuous speech recognition: The SPHINX system. [D] . Lee, Kai-Fu. 1988

机译：独立于大词汇的说话者的连续语音识别：SPHINX系统。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speaker Clustering And Transformation For Speaker Adaptation In Large-Vocabulary Speech Recognition Systems [O] . M. Padmanabhan, L. R. Bahl, D. Nahamoo, 1995

机译：大词汇量语音识别系统中说话人聚类和说话人适应的转换
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems

摘要

著录项

相似文献

相关主题

期刊订阅