首页> 外文会议>International Conference on Text,Speech and Dialogue(TSD 2004); 20040908-11; Brno(CZ) >A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition
【24h】

A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition

机译:连续语音识别中说话人快速适应的说话人聚类算法

获取原文
获取原文并翻译 | 示例

摘要

In this paper a speaker adaptation methodology is proposed, which first automatically determines a number of speaker clusters in the training material, then estimates the parameters of the corresponding models, and finally applies a fast match strategy - based on the so called histogram models - to choose the optimal cluster for each test utterance. The fast match strategy is critical to make this methodology useful in real applications, since carrying out several recognition passes - one for each cluster of speakers -, and then selecting the decoded string with the highest likelihood, would be too costly. Preliminary experimentation over two speech databases in Spanish reveal that both the clustering algorithm and the fast match strategy are consistent and reliable. The histogram models, though being suboptimal - they succeeded in guessing the right cluster for unseen test speakers in 85% of the cases with read speech, and in 63% of the cases with spontaneous speech - , yielded around a 6% decrease in error rate in phonetic recognition experiments.
机译:本文提出了一种说话人适应方法,该方法首先自动确定培训材料中的说话人群体,然后估计相应模型的参数,最后基于所谓的直方图模型将快速匹配策略应用于为每种测试发音选择最佳的聚类。快速匹配策略对于使此方法在实际应用中有用至关重要,因为执行多次识别遍历(针对每个说话者群集进行一次),然后选择可能性最高的解码字符串会非常昂贵。对两个西班牙语语音数据库的初步实验表明,聚类算法和快速匹配策略都是一致且可靠的。直方图模型虽然不理想-他们成功地为85%的阅读语音案例和63%的自发语音案例猜测了看不见的测试说话者的正确聚类-产生了大约6%的错误率降低在语音识别实验中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号