首页> 外文期刊>Information Systems >Discovering frequent geometric subgraphs
【24h】

Discovering frequent geometric subgraphs

机译:发现频繁的几何子图

获取原文
获取原文并翻译 | 示例
           

摘要

Data mining-based analysis methods are increasingly being applied to data sets derived from science and engineering domains that model various physical phenomena and objects. In many of these data sets, a key requirement for their effective analysis is the ability to capture the relational and geometric characteristics of the underlying entities and objects. Geometric graphs, by modeling the various physical entities and their relationships with vertices and edges, provide a natural method to represent such data sets. In this paper we present gFSG, a computationally efficient algorithm for finding frequent patterns corresponding to geometric subgraphs in a large collection of geometric graphs. gFSG is able to discover geometric subgraphs that can be rotation, scaling, and translation invariant, and it can accommodate inherent errors on the coordinates of the vertices. We evaluated its performance using a large database of over 20,000 chemical structures, and our results show that it requires relatively little time, can accommodate low support values, and scales linearly with the number of transactions.
机译:基于数据挖掘的分析方法越来越多地应用于从科学和工程领域建模各种物理现象和物体的数据集。在许多这样的数据集中,对其有效分析的关键要求是能够捕获基础实体和对象的关系和几何特征。通过对各种物理实体及其与顶点和边的关系进行建模,几何图提供了一种表示此类数据集的自然方法。在本文中,我们介绍了gFSG,这是一种计算有效的算法,用于查找与大量几何图形集合中的几何子图形相对应的频繁模式。 gFSG能够发现可能是旋转,缩放和平移不变的几何子图,并且它可以容纳顶点坐标上的固有误差。我们使用包含20,000多种化学结构的大型数据库评估了它的性能,结果表明,它需要相对较少的时间,可以容纳较低的支持价值,并且可以随着交易数量线性地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号