Speakers clustering with stochastic VQ and clustering quality estimator

机译：带有随机VQ的说话人聚类和聚类质量估计器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Short segments speaker clustering has significant importance both for diarization and applications such as short push-to-tatk (PTT) segments clustering. In this paper we present a new way to cluster speech segments by applying a stochastic vector quantization (VQ) with a cosine metric together with a speaker clustering quality estimator based on logistic regression. The VQ is performed on codebooks of different sizes, and the choice of the best clustering result is estimated using logistic regression. The algorithm is tested on a large range of speakers, between 2 to 60. The results are compared to those of the mean-shift clustering method, which was already tested for this task several times. The results are a bit below those of the cosine similarity measure-based mean-shift clustering. The advantage is in the run-time which is approximately 10 times faster.

机译：短片段说话者聚类对于数字化和短按即说（PTT）短片段聚类等应用都具有重要意义。在本文中，我们提出了一种通过将带有余弦度量的随机向量量化（VQ）与基于逻辑回归的说话人聚类质量估计器一起应用来对语音片段进行聚类的新方法。在不同大小的码本上执行VQ，并使用逻辑回归估计最佳聚类结果的选择。该算法在2至60之间的大范围扬声器上进行了测试。结果与均值漂移聚类方法的结果进行了比较，均值漂移聚类方法已经为此任务进行了多次测试。结果比基于余弦相似性度量的均值漂移聚类结果要低一些。优点是运行时间快了大约10倍。

著录项

来源
《IEEE International Conference on Rebooting Computing》|2018年|1-5|共5页
会议地点
作者
Yishai Cohen; Itshak Lapidot;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Indexes; Logistics; Euclidean distance; Clustering algorithms; Training; Couplings; Electrical engineering;

机译：索引;物流;欧氏距离;聚类算法;培训;联轴器;电气工程;

相似文献

外文文献
中文文献
专利

1. Speaker Change Detection and Speaker Clustering Using VQ Distortion Measure [J] . Seiichi Nakagawa, Kazumasa Mori Systems and Computers in Japan . 2003,第13期

机译：使用VQ失真测量的说话人变化检测和说话人聚类
2. Online speaker change detection and speaker clustering using VQ distortion measure [J] . Kazumasa Mori, Kazumasa Yamamoto, Seiichi Nakagawa 電子情報通信学会技術研究報告. 音声. Speech . 2000,第137期

机译：在线扬声器使用VQ失真测量更改检测和扬声器聚类
3. Online speaker change detection and speaker clustering using VQ distortion measure [J] . Kazumasa Mori, Kazumasa Yamamoto, Seiichi Nakagawa 電子情報通信学会技術研究報告. 音声. Speech . 2000,第137期

机译：在线扬声器使用VQ失真测量更改检测和扬声器聚类
4. Speakers clustering with stochastic VQ and clustering quality estimator [C] . Yishai Cohen, Itshak Lapidot IEEE International Conference on Rebooting Computing . 2018

机译：扬声器与随机VQ和聚类质量估算器聚类
5. Efficient speaker recognition using speaker model clusters. [D] . Apsingekar, Vijendra Raj. 2009

机译：使用说话人模型集群进行有效的说话人识别。
6. On the use of robust estimators for standard errors in the presence of clustering when clustering membership is misspecified [O] . Manisha Desai, Susan W. Bryson, Thomas Robinson -1

机译：在群集成员身份的群集时在群集时使用鲁棒估算器的使用鲁棒估算器
7. Speaker Change Detection and Speaker Clustering Using VQ Distortion for Broadcast News Speech Recognition [O] . Kazumasa Mori, Seiichi Nakagawa 2001

机译：利用VQ失真进行广播新闻语音识别的扬声器变化检测和扬声器聚类

Speakers clustering with stochastic VQ and clustering quality estimator

摘要

著录项

相似文献

相关主题

期刊订阅