首页> 外文会议>ACM SIGKDD international conference on Knowledge discovery in data mining >Discovering frequent topological structures from graph datasets
【24h】

Discovering frequent topological structures from graph datasets

机译:从图数据集中发现频繁的拓扑结构

获取原文

摘要

The problem of finding frequent patterns from graph-based datasets is an important one that finds applications in drug discovery, protein structure analysis, XML querying, and social network analysis among others. In this paper we propose a framework to mine frequent large-scale structures, formally defined as frequent topological structures, from graph datasets. Key elements of our framework include, fast algorithms for discovering frequent topological patterns based on the well known notion of a topological minor, algorithms for specifying and pushing constraints deep into the mining process for discovering constrained topological patterns, and mechanisms for specifying approximate matches when discovering frequent topological patterns in noisy datasets. We demonstrate the viability and scalability of the proposed algorithms on real and synthetic datasets and also discuss the use of the framework to discover meaningful topological structures from protein structure data.
机译:从基于图的数据集中查找频繁模式的问题是一个重要的问题,它可以发现药物发现,蛋白质结构分析,XML查询以及社交网络分析等方面的应用。在本文中,我们提出了一个框架,用于从图数据集中挖掘频繁的大规模结构,该结构正式定义为频繁的拓扑结构。我们框架的关键要素包括:基于众所周知的拓扑未成年人概念的快速算法,用于发现频繁的拓扑模式;用于指定约束并将约束推入挖掘过程中以发现受约束的拓扑模式的算法;以及用于在发现时指定近似匹配项的机制嘈杂数据集中的常见拓扑模式。我们展示了在真实和合成数据集上提出的算法的可行性和可扩展性,还讨论了使用框架从蛋白质结构数据中发现有意义的拓扑结构的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号