首页> 外文会议>Conference on Neural Information Processing Systems >Correlation Clustering with Local Objectives
【24h】

Correlation Clustering with Local Objectives

机译:与当地目标的相关聚类

获取原文

摘要

Correlation Clustering is a powerful graph partitioning model that aims to cluster items based on the notion of similarity between items. An instance of the Correlation Clustering problem consists of a graph G (not necessarily complete) whose edges are labeled by a binary classifier as "similar" and "dissimilar". An objective which has received a lot of attention in literature is that of minimizing the number of disagreements: an edge is in disagreement if it is a "similar" edge and is present across clusters or if it is a "dissimilar" edge and is present within a cluster. Define the disagreements vector to be an n dimensional vector indexed by the vertices, where the v-th index is the number of disagreements at vertex v. Recently, Puleo and Milenkovic (ICML'16) initiated the study of the Correlation Clustering framework in which the objectives were more general functions of the disagreements vector. In this paper, we study algorithms for minimizing l_q norms (q ≥ 1) of the disagreements vector for both arbitrary and complete graphs. We present the first known algorithm for minimizing the l_q norm of the disagreements vector on arbitrary graphs and also provide an improved algorithm for minimizing the l_q norm (q ≥ 1) of the disagreements vector on complete graphs. We also study an alternate cluster-wise local objective introduced by Ahmadi, Khuller and Saha (IPCO '19), which aims to minimize the maximum number of disagreements associated with a cluster. We also present an improved (2 + ε)-approximation algorithm for this objective. Finally, we compliment our algorithmic results for minimizing the l_q norm of the disagreements vector with some hardness results.
机译:相关聚类是一个强大的图形分区模型,其旨在基于项目之间的相似性的概念群集项目。相关聚类问题的一个实例由图形g(不一定完成)组成,其边缘由二进制分类器标记为“类似”和“不同”。在文献中接受了很多关注的目标是最小化分歧的数量:如果它是“类似”的边缘,则边缘处于分歧状态,并且存在于簇中,或者它是“不同的”边缘并且存在在群集中。定义分列向量是由顶点索引的N维矢量,其中V-T索引是Vertex v的分歧次数。最近,Puleo和Milenkovic(ICML'16)启动了对相关聚类框架的研究目的是分歧向量的更一般的职能。在本文中,我们研究了用于最小化分歧向量的L_Q标准(Q≥1)的算法,用于任意和完整的图形。我们介绍了第一已知算法,用于最小化任意图表上的分列向量的L_Q标准,并且还提供了一种改进的算法,用于最小化分列矢量在完整图表上的L_Q常态(Q≥1)。我们还研究了Ahmadi,Khuller和Saha(IPCO'19)引入的替代聚类目的目标,旨在最大限度地减少与群集相关的最大分歧次数。我们还提出了一种改进的(2 +ε) - 销售算法。最后,我们赞扬我们的算法结果,以使分歧向量的L_Q标准与一些硬度结果最小化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号