首页> 外文会议>International Conference on Text,Speech and Dialogue >A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition
【24h】

A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognition

机译:用于连续语音识别的快速扬声器适应的扬声器聚类算法

获取原文

摘要

In this paper a speaker adaptation methodology is proposed, which first automatically determines a number of speaker clusters in the training material, then estimates the parameters of the corresponding models, and finally applies a fast match strategy-based on the so called histogram models-to choose the optimal cluster for each test utterance. The fast match strategy is critical to make this methodology useful in real applications, since carrying out several recognition passes-one for each cluster of speakers-, and then selecting the decoded string with the highest likelihood, would be too costly. Preliminary experimentation over two speech databases in Spanish reveal that both the clustering algorithm and the fast match strategy are consistent and reliable. The histogram models, though being suboptimal-they succeeded in guessing the right cluster for unseen test speakers in 85% of the cases with read speech, and in 63% of the cases with spontaneous speech-, yielded around a 6% decrease in error rate in phonetic recognition experiments.
机译:在本文中,提出了一种扬声器适配方法,该方法首先自动确定训练材料中的许多扬声器群集,然后估计相应模型的参数,最后应用了基于所谓的直方图模型的快速匹配策略 - 到为每个测试话语选择最佳群集。快速匹配策略对于使该方法具有重要应用的方法至关重要,因为对每个扬声器进行几个识别传递 - 一个识别器 - 然后选择具有最高可能性的解码字符串,这将是太昂贵的。两种语音数据库中的初步实验揭示了聚类算法和快速匹配策略都是一致可靠的。直方图模型虽然是次优 - 他们成功地猜测了85%的读语言的85%案例中的正确集群,并且在63%的自发演讲中的情况下,误差率下降约6%在语音识别实验中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号