A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition

机译：连续语音识别中说话人快速适应的说话人聚类算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper a speaker adaptation methodology is proposed, which first automatically determines a number of speaker clusters in the training material, then estimates the parameters of the corresponding models, and finally applies a fast match strategy - based on the so called histogram models - to choose the optimal cluster for each test utterance. The fast match strategy is critical to make this methodology useful in real applications, since carrying out several recognition passes - one for each cluster of speakers -, and then selecting the decoded string with the highest likelihood, would be too costly. Preliminary experimentation over two speech databases in Spanish reveal that both the clustering algorithm and the fast match strategy are consistent and reliable. The histogram models, though being suboptimal - they succeeded in guessing the right cluster for unseen test speakers in 85% of the cases with read speech, and in 63% of the cases with spontaneous speech - , yielded around a 6% decrease in error rate in phonetic recognition experiments.

机译：本文提出了一种说话人适应方法，该方法首先自动确定培训材料中的说话人群体，然后估计相应模型的参数，最后基于所谓的直方图模型将快速匹配策略应用于为每种测试发音选择最佳的聚类。快速匹配策略对于使此方法在实际应用中有用至关重要，因为执行多次识别遍历（针对每个说话者群集进行一次），然后选择可能性最高的解码字符串会非常昂贵。对两个西班牙语语音数据库的初步实验表明，聚类算法和快速匹配策略都是一致且可靠的。直方图模型虽然不理想-他们成功地为85％的阅读语音案例和63％的自发语音案例猜测了看不见的测试说话者的正确聚类-产生了大约6％的错误率降低在语音识别实验中。

著录项

来源
《International Conference on Text,Speech and Dialogue(TSD 2004); 20040908-11; Brno(CZ)》|2004年|P.433-440|共8页
会议地点 Brno(CZ)
作者
Luis Javier Rodriguez; M. Ines Torres;
展开▼
作者单位

Pattern Recognition Speech Technology Group DEE. Facultad de Ciencia y Tecnologia. Universidad del Pais Vasco Apartado 644. 48080 Bilbao. Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. An acoustic-phonetic-based speaker adaptation technique forimproving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
2. An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceeding . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
3. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
4. A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition [C] . Luis Javier Rodriguez, M. Ines Torres International Conference on Text,Speech and Dialogue . 2004

机译：用于连续语音识别的快速扬声器适应的扬声器聚类算法
5. Real-time speaker -independent large vocabulary continuous speech recognition. [D] . Li, Xiaolong. 2005

机译：实时独立于说话者的大词汇量连续语音识别。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speaker Adaptation By Modeling The Speaker Variation In A Continuous Speech Recognition System [O] . Nikko Ström 2007

机译：通过建模连续语音识别系统中的说话人变异来调整说话人

A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅