A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition

机译：用于连续语音识别的快速扬声器适应的扬声器聚类算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper a speaker adaptation methodology is proposed, which first automatically determines a number of speaker clusters in the training material, then estimates the parameters of the corresponding models, and finally applies a fast match strategy-based on the so called histogram models-to choose the optimal cluster for each test utterance. The fast match strategy is critical to make this methodology useful in real applications, since carrying out several recognition passes-one for each cluster of speakers-, and then selecting the decoded string with the highest likelihood, would be too costly. Preliminary experimentation over two speech databases in Spanish reveal that both the clustering algorithm and the fast match strategy are consistent and reliable. The histogram models, though being suboptimal-they succeeded in guessing the right cluster for unseen test speakers in 85% of the cases with read speech, and in 63% of the cases with spontaneous speech-, yielded around a 6% decrease in error rate in phonetic recognition experiments.

机译：在本文中，提出了一种扬声器适配方法，该方法首先自动确定训练材料中的许多扬声器群集，然后估计相应模型的参数，最后应用了基于所谓的直方图模型的快速匹配策略 - 到为每个测试话语选择最佳群集。快速匹配策略对于使该方法具有重要应用的方法至关重要，因为对每个扬声器进行几个识别传递 - 一个识别器 - 然后选择具有最高可能性的解码字符串，这将是太昂贵的。两种语音数据库中的初步实验揭示了聚类算法和快速匹配策略都是一致可靠的。直方图模型虽然是次优 - 他们成功地猜测了85％的读语言的85％案例中的正确集群，并且在63％的自发演讲中的情况下，误差率下降约6％在语音识别实验中。

著录项

来源
《International Conference on Text,Speech and Dialogue》|2004年||共8页
会议地点
作者
Luis Javier Rodriguez; M. Ines Torres;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. An acoustic-phonetic-based speaker adaptation technique forimproving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
2. An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceeding . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
3. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
4. A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition [C] . Luis Javier Rodriguez, M. Ines Torres International Conference on Text,Speech and Dialogue(TSD 2004); 20040908-11; Brno(CZ) . 2004

机译：连续语音识别中说话人快速适应的说话人聚类算法
5. Real-time speaker -independent large vocabulary continuous speech recognition. [D] . Li, Xiaolong. 2005

机译：实时独立于说话者的大词汇量连续语音识别。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speaker Adaptation By Modeling The Speaker Variation In A Continuous Speech Recognition System [O] . Nikko Ström 2007

机译：通过建模连续语音识别系统中的说话人变异来调整说话人

A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅