Discriminative Training for Hierarchical Clustering in Speaker Diarization

机译：说话人差异化中层次聚类的判别训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a discriminative extension to agglom-erative hierarchical clustering, a typical technique for speaker diarization, that fits seamlessly with most state-of-the art diarization algorithms. We propose to use maximum mutual information using bootstrapping i.e., initial predictions are used as input for retraining of models in an unsupervised fashion. This article describes this new approach, analyzes its behavior, and presents results on the official NIST Rich Transcription datasets. We show an absolute improvement of 4 % DER with respect to the generative approach baseline. We also observe a strong correlation between the original error and the amount of improvement, that is, the better our predicted labels are, the more gain we obtain from discriminative training, which we interpret as a strong indication for the high potential of the extension.

机译：在本文中，我们提出了对聚类层次聚类的判别性扩展，聚类层次聚类是一种典型的说话人歧化技术，可与大多数最新的歧化算法无缝配合。我们建议使用自举法使用最大的互信息，即以无监督的方式将初始预测用作模型再训练的输入。本文介绍了这种新方法，分析了它的行为，并在NIST丰富转录官方数据集上给出了结果。我们显示相对于生成方法基准，绝对改善了4％的DER。我们还观察到原始错误与改进量之间存在很强的相关性，也就是说，我们的预测标签越好，我们从判别训练中获得的收益就越大，我们将其解释为扩展潜力巨大的有力迹象。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2326-2329|共4页
会议地点
作者
Oriol Vinyals; Gerald Friedland; Nelson Morgan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
discriminative learning; maximum mutual information; speaker diarization;

机译：歧视性学习;最大程度的相互信息;说话人差异化;

相似文献

外文文献
中文文献
专利

1. Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization [J] . Han K.J., Kim S., Narayanan S.S. IEEE transactions on audio, speech and language processing . 2008,第8期

机译：数据源变化下说话者差异化下提高聚集层次聚类鲁棒性的策略
2. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
3. Incomplete-Data-Driven Speaker Segmentation for Diarization Application; A Help-Training Approach [J] . Teimoori Farshad, Razzazi Farbod Circuits, systems, and signal processing . 2019,第6期

机译：数据驱动的不完整说话人细分，以实现数字化应用；帮助培训方法
4. Discriminative Training for Hierarchical Clustering in Speaker Diarization [C] . Oriol Vinyals, Gerald Friedland, Nelson Morgan Annual conference of the International Speech Communication Association . 2010

机译：扬声器日期中分层聚类的判别培训
5. Discriminative training for speaker adaptation and minimum Bayes risk estimation in large vocabulary speech recognition. [D] . Doumpiotis, Vlasios. 2005

机译：大词汇量语音识别中的说话人适应性和最低贝叶斯风险估计的判别训练。
6. In sepsis 88 of bacteraemia patients are discriminated by unsupervised hierarchical cluster analysis of 5 inflammatory mediators [O] . KA Mosevoll, H Reikvam, HR Fanebust, 2015

机译：在败血症中通过无监督的5种炎症介质的层次聚类分析来区分88％的菌血症患者
7. New insights into hierarchical clustering and linguistic normalization for speaker diarization [O] . BOZONNET Simon, EVANS Nicholas W. D., MERIALDO Bernard 2012

机译：对说话人日记化的层次聚类和语言规范化的新见解

Discriminative Training for Hierarchical Clustering in Speaker Diarization

摘要

著录项

相似文献

相关主题

期刊订阅