首页> 外文会议>ACM SIGMOD international conference on management of data >Towards Proximity Pattern Mining in Large Graphs
【24h】

Towards Proximity Pattern Mining in Large Graphs

机译:在大图中迈向近距离模式

获取原文

摘要

Mining graph patterns in large information networks is critical to a variety of applications such as malware detection and biological module discovery. However, frequent subgraphs are often ineffective to capture association existing in these applications, due to the complexity of isomorphism testing and the inelastic pattern definition. In this paper, we introduce proximity pattern which is a significant departure from the traditional concept of frequent subgraphs. Defined as a set of labels that co-occur in neighborhoods, proximity pattern blurs the boundary between itemset and structure. It relaxes the rigid structure constraint of frequent subgraphs, while introducing connectivity to frequent itemsets. Therefore, it can benefit from both: efficient mining in itemsets and structure proximity from graphs. We developed two models to define proximity patterns. The second one, called Normalized Probabilistic Association (NmPA), is able to transform a complex graph mining problem to a simplified probabilistic itemset mining problem, which can be solved efficiently by a modified FP-tree algorithm, called pFP. NmPA and pFP are evaluated on real-life social and intrusion networks. Empirical results show that it not only finds interesting patterns that are ignored by the existing approaches, but also achieves high performance for finding proximity patterns in large-scale graphs.
机译:大型信息网络中的挖掘图形模式对于多种应用,如恶意软件检测和生物模块发现是至关重要的。然而,由于同构检测的复杂性和非弹性模式定义,频繁的子图通常无效地捕获这些应用中存在的关联。在本文中,我们引入了近似模式,这是一种从传统频繁子图的概念的重要偏离。定义为在邻域中共同发生的一组标签,邻近模式模糊项目集和结构之间的边界。它放松了频繁子图的刚性结构约束,同时引入频繁项目集的连接。因此,它可以从两者中受益:在项目集中有效挖掘和从图形的结构接近。我们开发了两个模型来定义接近模式。第二个称为归一化概率关联(NMPA),能够将复杂的图形挖掘问题转换为简化的概率算法挖掘问题,其可以通过调制的FP树算法有效地解决,称为PFP。 NMPA和PFP在现实社会和入侵网络上进行评估。经验结果表明,它不仅找到了现有方法忽略的有趣模式,而且还可以实现大规模图表中查找近距离模式的高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号