首页> 外文期刊>Algorithmica >On the Advantage of Overlapping Clusters for Minimizing Conductance
【24h】

On the Advantage of Overlapping Clusters for Minimizing Conductance

机译:关于重叠群集的优势,以最小化电导

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Graph clustering is an important problem with applications to bioinformat-ics, community discovery in social networks, distributed computing, and more. While most of the research in this area has focused on clustering using disjoint clusters, many real datasets have inherently overlapping clusters. We compare overlapping and non-overlapping clusterings in graphs in the context of minimizing their conductance. It is known that allowing clusters to overlap gives better results in practice. We prove that overlapping clustering may be significantly better than non-overlapping clustering with respect to conductance, even in a theoretical setting. For minimizing the maximum conductance over the clusters, we give examples demonstrating that allowing overlaps can yield significantly better clusterings, namely, one that has much smaller optimum. In addition for the min-max variant, the overlapping version admits a simple approximation algorithm, while our algorithm for the non-overlapping version is complex and yields a worse approximation ratio due to the presence of the additional constraint. Somewhat surprisingly, for the problem of minimizing the sum of conductances, we found out that allowing overlap does not help. We show how to apply a general technique to transform any overlapping clustering into a non-overlapping one with only a modest increase in the sum of con-ductances. This uncrossing technique is of independent interest and may find further applications in the future. We consider this work as a step toward rigorous comparison of overlapping and non-overlapping clusterings and hope that it stimulates further research in this area.
机译:对于生物信息学,社交网络中的社区发现,分布式计算等应用,图集群是一个重要的问题。尽管该领域的大多数研究都集中在使用不相交的聚类进行聚类,但许多实际数据集具有内在重叠的聚类。我们在使电导最小的情况下比较图中的重叠和非重叠聚类。众所周知,在实践中允许群集重叠会产生更好的结果。我们证明,就电导而言,即使在理论上,重叠聚类也可能比非重叠聚类好得多。为了使群集上的最大电导最小,我们给出一些示例,说明允许重叠可以产生明显更好的群集,即,最佳群集要小得多。另外,对于min-max变体,重叠版本允许使用简单的近似算法,而我们的非重叠版本的算法则很复杂,并且由于存在附加约束,因此产生的近似率也较差。令人惊讶的是,对于最小化电导和的问题,我们发现允许重叠是没有帮助的。我们展示了如何应用一种通用技术将任何重叠的簇转换为不重叠的簇,而仅增加电导和。这种非交叉技术具有独立的意义,并且将来可能会发现更多的应用。我们认为这项工作是朝着严格比较重叠和非重叠群集迈出的一步,并希望它能激发这一领域的进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号