首页> 外文会议>International conference on advanced data mining and applications >Restoring: A Greedy Heuristic Approach Based on Neighborhood for Correlation Clustering
【24h】

Restoring: A Greedy Heuristic Approach Based on Neighborhood for Correlation Clustering

机译:恢复:基于相关聚类邻域的贪婪启发式方法

获取原文

摘要

Correlation Clustering has received considerable attention in machine learning literature due to its not requiring specifying the number of clusters in advance. Many approximation algorithms for Correlation Clustering have been proposed with worst-case theoretical guarantees, but with less experimental evaluations. These methods simply consider the direct associations between vertices and achieve poor performance in real datasets. In this paper, we propose a neighborhood-based method called Restoring, in which we argue that the neighborhood around two connected vertices is important and two vertices belonging to the same cluster should have the same neighborhood. Our algorithm iteratively chooses two connected vertices and restores their neighborhood. We also define the cost of keeping or removing one non-common neighbor and identify a restoring order based on the neighborhood similarity. Experiments conducted on five sub datasets of Cora show that our method performs better than existing well-known methods both in results quality and objective value.
机译:由于其不需要提前指定集群数量,相关聚类在机器学习文献中受到了相当大的关注。已经提出了许多用于相关聚类的近似算法,具有最坏的情况的理论保证,但实验性评估较少。这些方法只是考虑顶点之间的直接关联并在实时数据集中实现差的性能。在本文中,我们提出了一种称为恢复的基于邻域的方法,其中我们争论两个连接的顶点周围的邻域是重要的,并且属于同一群集的两个顶点应该具有相同的邻域。我们的算法迭代地选择两个连接的顶点并恢复它们的邻居。我们还定义了保持或删除一个非公共邻居的成本,并根据邻域的相似度识别恢复顺序。对Cora五个子数据集进行的实验表明,我们的方法在结果质量和客观价值中比现有的众所周知方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号