首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Discriminative Training for Hierarchical Clustering in Speaker Diarization
【24h】

Discriminative Training for Hierarchical Clustering in Speaker Diarization

机译:说话人差异化中层次聚类的判别训练

获取原文

摘要

In this paper, we propose a discriminative extension to agglom-erative hierarchical clustering, a typical technique for speaker diarization, that fits seamlessly with most state-of-the art diarization algorithms. We propose to use maximum mutual information using bootstrapping i.e., initial predictions are used as input for retraining of models in an unsupervised fashion. This article describes this new approach, analyzes its behavior, and presents results on the official NIST Rich Transcription datasets. We show an absolute improvement of 4 % DER with respect to the generative approach baseline. We also observe a strong correlation between the original error and the amount of improvement, that is, the better our predicted labels are, the more gain we obtain from discriminative training, which we interpret as a strong indication for the high potential of the extension.
机译:在本文中,我们提出了对聚类层次聚类的判别性扩展,聚类层次聚类是一种典型的说话人歧化技术,可与大多数最新的歧化算法无缝配合。我们建议使用自举法使用最大的互信息,即以无监督的方式将初始预测用作模型再训练的输入。本文介绍了这种新方法,分析了它的行为,并在NIST丰富转录官方数据集上给出了结果。我们显示相对于生成方法基准,绝对改善了4%的DER。我们还观察到原始错误与改进量之间存在很强的相关性,也就是说,我们的预测标签越好,我们从判别训练中获得的收益就越大,我们将其解释为扩展潜力巨大的有力迹象。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号