Robust speaker clustering quality estimation

机译：健壮的说话人聚类质量估计

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper focuses on estimating the quality of a clustering process. In our case - the task is to cluster short speech segments that belong to different speakers. Moreover, speaker clustering quality may be well estimated on several clustering approaches if they all based on the same features. This is very important, as it allows us to use the same quality estimation system without retraining, and achieve reasonable results even when the clustering method is changed. We predict the system's quality by applying a logistic regression estimator on a several statistical parameters of the clustering. In this paper, mean-shift clustering with either cosine or probabilistic linear discriminant analysis (PLDA) score as similarity measure, and stochastic vector quantization (VQ) with cosine distance were applied in order to cluster the short speaker segments represented by i-vectors. The quality of the clustering is measured using the average cluster purity (ACP), average speaker purity (ASP) and K. We show that these measures can be estimated fairly well by applying logistic regression based on various clustering statistics that calculated once clustering is over. These statistical parameters are used as a feature vector representing the clustering.

机译：本文着重于评估聚类过程的质量。在我们的案例中，任务是将属于不同说话者的简短语音片段聚类。而且，如果说话者的聚类质量全部基于相同的特征，则可以在几种聚类方法上很好地估计它们。这非常重要，因为它允许我们使用相同的质量评估系统而无需重新训练，即使更改聚类方法，也可以获得合理的结果。我们通过对聚类的几个统计参数应用逻辑回归估计量来预测系统的质量。在本文中，采用余弦或概率线性判别分析（PLDA）得分作为相似性度量的均值漂移聚类，并使用具有余弦距离的随机矢量量化（VQ）进行聚类，以聚类由i-vector表示的短说话者片段。聚类的质量是使用平均聚类纯度（ACP），平均说话者纯度（ASP）和K来衡量的。我们证明，通过基于各种聚类统计量（对聚类结束后进行计算）进行对数回归，可以很好地估计这些指标。这些统计参数用作表示聚类的特征向量。

著录项

来源
《IEEE International Conference on Rebooting Computing》|2018年|1-5|共5页
会议地点
作者
Yishai Cohen; Itshak Lapidot;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Logistics; Estimation; Training; Clustering methods; Measurement; Feature extraction;

机译：聚类算法;物流;估计;训练;聚类方法;测量;特征提取;

相似文献

外文文献
中文文献
专利

1. Speaker clustering quality estimation with logistic regression [J] . Yishai Cohen, Itshak Lapidot Computer speech and language . 2021,第Jana期

机译：具有逻辑回归的扬声器聚类质量估计
2. Improving Robustness Of Mllr Adaptation With Speaker-clustered Regression Class Trees [J] . Arindam Mandal, Mari Ostendorf, Andreas Stolcke Computer speech and language . 2009,第2期

机译：说话者群集回归类树提高Mllr适应的鲁棒性
3. Robust Speaker Clustering Using Affinity Propagation [J] . Xiang ZHANG, Ping LU, Hongbin SUO, IEICE Transactions on Information and Systems . 2008,第11期

机译：使用相似性传播的强大说话人聚类
4. Robust speaker clustering quality estimation [C] . Yishai Cohen, Itshak Lapidot IEEE International Conference on Rebooting Computing . 2018

机译：强大的扬声器聚类质量估计
5. Robust speaker clustering under variation in data characteristics. [D] . Han, Kyu Jeong. 2009

机译：在数据特性变化的情况下，强大的扬声器群集。
6. Robust estimation of the probabilities of 3‐D clusters in functional brain images: Application to PET data [O] . Anders Ledberg 2000

机译：对功能性脑图像中3D簇的概率的可靠估计：在PET数据中的应用
7. Rescaling clustering trees using impact ratios for robust hierarchical speaker clustering [O] . Ghaemmaghami Houman, Dean David, Kalantari Shahram, 2014

机译：使用影响比重新缩放聚类树，以实现强大的分层说话者聚类

Robust speaker clustering quality estimation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅