首页> 外文会议>European Signal Processing Conference >A novel method for selecting the number of clusters in a speaker diarization system
【24h】

A novel method for selecting the number of clusters in a speaker diarization system

机译:一种在说话人差异化系统中选择簇数的新方法

获取原文

摘要

This paper introduces the cluster score (C-score) as a measure for determining a suitable number of clusters when performing speaker clustering in a speaker diarization system. C-score finds a trade-off between intra-cluster and extra-cluster similarities, selecting a number of clusters with cluster elements that are similar between them but different to the elements in other clusters. Speech utterances are represented by Gaussian mixture model mean supervectors, and also the projection of the supervectors into a low-dimensional discriminative subspace by linear discriminant analysis is assessed. This technique shows robustness to segmentation errors and, compared with the widely used Bayesian information criterion (BIC)-based stopping criterion, results in a lower speaker clustering error and dramatically reduces computation time. Experiments were run using the broadcast news database used for the Albayzin 2010 Speaker Diarization Evaluation.
机译:本文介绍了聚类得分(C分数),作为在说话者区分系统中执行说话者聚类时确定合适数目的聚类的一种方法。 C分数在集群内相似性与集群外相似性之间找到了一个折衷,选择了多个集群,这些集群之间的集群元素相似但与其他集群中的元素不同。用高斯混合模型平均超向量表示语音,并通过线性判别分析评估超向量在低维判别子空间中的投影。该技术显示了对分割错误的鲁棒性,并且与广泛使用的基于贝叶斯信息标准(BIC)的停止标准相比,可以降低说话者聚类错误并显着减少计算时间。使用用于《 Albayzin 2010演讲者差异化评估》的广播新闻数据库进行实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号