首页> 外文期刊>Data & Knowledge Engineering >Mining globally distributed frequent subgraphs in a single labeled graph
【24h】

Mining globally distributed frequent subgraphs in a single labeled graph

机译:在单个标签图中挖掘全局分布的频繁子图

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Recent years have observed increasing efforts on graph mining and many algorithms have been developed for this purpose. However, most of the existing algorithms are designed for discovering frequent subgraphs in a set of labeled graphs only. Also, the few algorithms that find frequent subgraphs in a single labeled graph typically identify subgraphs appearing regionally in the input graph. In contrast, for real-world applications, it is commonly required that the identified frequent subgraphs in a single labeled graph should also be globally distributed. This paper thus fills this crucial void by proposing a new measure, termed G-Measure, to find globally distributed frequent subgraphs, called G-Patterns, in a single labeled graph. Specifically, we first show that the G-Patterns, selected by G-Measure, tend to be globally distributed in the input graph. Then, we present that G-Measure has the downward closure property, which guarantees the G-Measure value of a G-Pattern is not less than those of its supersets. Consequently, a G-Miner algorithm is developed for finding G-Patterns. Experimental results on four synthetic and seven real-world data sets and comparison with the existing algorithms demonstrate the efficacy of the G-Measure and the G-Miner for finding G-Patterns. Finally, an application of the G-Patterns is given.
机译:近年来,人们观察到了在图挖掘方面不断增加的努力,并且为此目的开发了许多算法。但是,大多数现有算法仅用于发现一组标记图中的频繁子图。同样,在单个带标签的图中找到频繁的子图的少数算法通常会标识在输入图中局部出现的子图。相反,对于现实世界的应用程序,通常要求在单个标记图中标识的频繁子图也应该全局分布。因此,本文通过提出一种称为G-Measure的新方法来填补这一关键空白,以在单个带标签的图中找到全局分布的频繁子图,称为G-Patterns。具体来说,我们首先表明,由G-Measure选择的G模式倾向于在输入图中全局分布。然后,我们提出G-Measure具有向下封闭的性质,这保证了G-Pattern的G-Measure值不小于其超集的G-Measure值。因此,开发了一种G-Miner算法来查找G模式。在四个合成和七个真实数据集上进行的实验结果以及与现有算法的比较证明,G-Measure和G-Miner可以找到G模式。最后,给出了G模式的应用。

著录项

  • 来源
    《Data & Knowledge Engineering》 |2009年第10期|1034-1058|共25页
  • 作者单位

    School of Computer Engineering, Nanyang Technological University. Singapore 639798;

    Management Science and Information Systems Department, Rutgers University, NJ 07102, USA;

    IBM China Research Laboratory, Beijing 100094, China;

    School of Computer Engineering, Nanyang Technological University. Singapore 639798;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    frequent subgraph mining; G-Measure; G-Pattern;

    机译:频繁的子图挖掘;G-措施;G模式;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号