【24h】

Speaker Clustering Based on Non-negative Matrix Factorization

机译:基于非负矩阵分解的说话人聚类

获取原文

摘要

This paper addresses unsupervised speaker clustering for multiparty conversations. Hierarchical clustering methods were mainly used in previous studies. However, these methods require many processes, such as distance calculation and cluster merging, when there are many utterances in conversation data. We propose a clustering method based on non-negative matrix factorization. The proposed method can perform fast and robust clustering by decomposing a matrix consisting of distances between models. We conducted speaker clustering experiments using a Bayesian information criterion based method, a method based on the likelihood ratio between Gaussian mixture models, and the proposed method. Experimental results showed that the proposed method achieves higher clustering accuracy than these conventional methods.
机译:本文讨论了用于多方对话的无监督说话者聚类。分层聚类方法主要用于以前的研究中。但是,当会话数据中有许多语音时,这些方法需要许多过程,例如距离计算和聚类合并。我们提出了一种基于非负矩阵分解的聚类方法。所提出的方法可以通过分解由模型之间的距离组成的矩阵来执行快速且鲁棒的聚类。我们使用基于贝叶斯信息准则的方法,基于高斯混合模型之间的似然比的方法以及提出的方法进行了说话人聚类实验。实验结果表明,与传统方法相比,该方法具有更高的聚类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号