首页> 外文会议>Odyssey 2010: the speaker and language recognition workshop >Speaker clustering via the mean shift algorithm
【24h】

Speaker clustering via the mean shift algorithm

机译:通过均值漂移算法进行说话人聚类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we investigate the use of the mean shift algorithm with respect to speaker clustering. The algorithm is an elegant nonparametric technique that has become very popular in image segmentation, video tracking and other image processing and computer vision tasks. Its primary aim is to detect the modes of the underlying density and consequently merge those observations being attracted by each mode. Since the number of modes is not needed to be known beforehand, the algorithm seems to fit well to the problem of speaker clustering. However, the algorithm needs to be adapted; the original algorithm acts on the space of observations, while speaker clustering algorithms act on the space of probabilistic parametric models. We attempt to adapt the algorithm, based on some basic concepts of information geometry, that are related to the exponential family of distributions.
机译:在本文中,我们针对说话人聚类研究了均值平移算法的使用。该算法是一种优雅的非参数技术,在图像分割,视频跟踪和其他图像处理以及计算机视觉任务中非常流行。它的主要目的是检测潜在密度的模式,并因此合并每种模式所吸引的那些观测值。由于不需要预先知道模式的数量,因此该算法似乎非常适合说话人聚类的问题。但是,需要对算法进行调整。原始算法作用于观测空间,而说话人聚类算法作用于概率参数模型空间。我们尝试根据信息几何的一些基本概念来调整算法,这些基本概念与指数分布族有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号