首页> 外文会议>IEEE International Conference on Rebooting Computing >Robust speaker clustering quality estimation
【24h】

Robust speaker clustering quality estimation

机译:健壮的说话人聚类质量估计

获取原文
获取外文期刊封面目录资料

摘要

This paper focuses on estimating the quality of a clustering process. In our case - the task is to cluster short speech segments that belong to different speakers. Moreover, speaker clustering quality may be well estimated on several clustering approaches if they all based on the same features. This is very important, as it allows us to use the same quality estimation system without retraining, and achieve reasonable results even when the clustering method is changed. We predict the system's quality by applying a logistic regression estimator on a several statistical parameters of the clustering. In this paper, mean-shift clustering with either cosine or probabilistic linear discriminant analysis (PLDA) score as similarity measure, and stochastic vector quantization (VQ) with cosine distance were applied in order to cluster the short speaker segments represented by i-vectors. The quality of the clustering is measured using the average cluster purity (ACP), average speaker purity (ASP) and K. We show that these measures can be estimated fairly well by applying logistic regression based on various clustering statistics that calculated once clustering is over. These statistical parameters are used as a feature vector representing the clustering.
机译:本文着重于评估聚类过程的质量。在我们的案例中,任务是将属于不同说话者的简短语音片段聚类。而且,如果说话者的聚类质量全部基于相同的特征,则可以在几种聚类方法上很好地估计它们。这非常重要,因为它允许我们使用相同的质量评估系统而无需重新训练,即使更改聚类方法,也可以获得合理的结果。我们通过对聚类的几个统计参数应用逻辑回归估计量来预测系统的质量。在本文中,采用余弦或概率线性判别分析(PLDA)得分作为相似性度量的均值漂移聚类,并使用具有余弦距离的随机矢量量化(VQ)进行聚类,以聚类由i-vector表示的短说话者片段。聚类的质量是使用平均聚类纯度(ACP),平均说话者纯度(ASP)和K来衡量的。我们证明,通过基于各种聚类统计量(对聚类结束后进行计算)进行对数回归,可以很好地估计这些指标。这些统计参数用作表示聚类的特征向量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号