首页> 外文会议>Advances in knowledge discovery and data mining >Efficient Mining of Combined Subspace and Subgraph Clusters in Graphs with Feature Vectors
【24h】

Efficient Mining of Combined Subspace and Subgraph Clusters in Graphs with Feature Vectors

机译:特征向量图中有效组合子空间和子图簇的挖掘

获取原文
获取原文并翻译 | 示例

摘要

Large graphs are ubiquitous in today's applications. Besides the mere graph structure, data sources usually provide information about single objects by feature vectors. To realize the full potential for knowledge extraction, recent approaches consider both information types simultaneously. Thus, for the task of clustering, combined clustering models determine object groups within one network that are densely connected and show similar characteristics. However, due to the inherent complexity of such a combination, the existing methods are not efficiently executable and are hardly applicable to large graphs. In this work, we develop a method for an efficient clustering of combined data sources, while at the same time finding high-quality results. We prove the complexity of our model and identify the critical parts inhibiting an efficient execution. Based on this analysis, we develop the algorithm EDCAR that approximates the optimal clustering solution using the established GRASP (Greedy Randomized Adaptive Search) principle. In thorough experiments we show that EDCAR outperforms all competing approaches in terms of runtime and simultaneously achieves high clustering qualities. For repeatability and further research we publish all datasets, executables and parameter settings on our website.
机译:大图在当今的应用程序中无处不在。除了单纯的图形结构外,数据源通常还通过特征向量提供有关单个对象的信息。为了实现知识提取的全部潜力,最近的方法同时考虑了两种信息类型。因此,对于群集的任务,组合的群集模型确定一个网络中密集连接并显示相似特征的对象组。然而,由于这种组合的固有复杂性,现有方法不能有效地执行,并且几乎不适用于大型图。在这项工作中,我们开发了一种对组合数据源进行有效聚类的方法,同时可以找到高质量的结果。我们证明了模型的复杂性,并确定了阻碍有效执行的关键部分。基于此分析,我们使用已建立的GRASP(贪婪随机自适应搜索)原理,开发了一种近似最佳聚类解决方案的算法EDCAR。在全面的实验中,我们证明了EDCAR在运行时方面优于所有竞争方法,并同时实现了高聚类质量。为了重复性和进一步研究,我们在我们的网站上发布了所有数据集,可执行文件和参数设置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号