【24h】

Clustering by Synchronization

机译:通过同步进行聚类

获取原文
获取外文期刊封面目录资料

摘要

Synchronization is a powerful basic concept in nature regulating a large variety of complex processes ranging from the metabolism in the cell to social behavior in groups of individuals. Therefore, synchronization phenomena have been extensively studied and models robustly capturing the dynamical synchronization process have been proposed, e.g. the Extensive Kuramoto Model. Inspired by the powerful concept of synchronization, we propose Sync, a novel approach to clustering. The basic idea is to view each data object as a phase oscillator and simulate the interaction behavior of the objects over time. As time evolves, similar objects naturally synchronize together and form distinct clusters. Inherited from synchronization, Sync has several desirable properties: The clusters revealed by dynamic synchronization truly reflect the intrinsic structure of the data set, Sync does not rely on any distribution assumption and allows detecting clusters of arbitrary number, shape and size. Moreover, the concept of synchronization allows natural outlier handling, since outliers do not synchronize with cluster objects. For fully automatic clustering, we propose to combine Sync with the Minimum Description Length principle. Extensive experiments on synthetic and real world data demonstrate the effectiveness and efficiency of our approach.
机译:同步是自然界中强有力的基本概念,它调节着各种各样复杂的过程,从细胞中的新陈代谢到个人群体的社会行为。因此,已经对同步现象进行了广泛的研究,并且提出了健壮地捕获动态同步过程的模型,例如,图3。广泛的仓本模型。受强大的同步概念启发,我们提出了Sync(一种新颖的集群方法)。基本思想是将每个数据对象视为一个相位振荡器,并随着时间的推移模拟对象的交互行为。随着时间的流逝,相似的对象自然会同步在一起并形成不同的簇。 Sync继承自同步,具有几个理想的属性:动态同步揭示的集群真正反映了数据集的固有结构,Sync不依赖于任何分布假设,并允许检测任意数量,形状和大小的集群。此外,同步的概念允许自然的离群值处理,因为离群值不与群集对象同步。对于全自动群集,我们建议将“同步”与“最小描述长度”原则结合在一起。对合成和真实世界数据进行的大量实验证明了我们方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号