首页> 外文会议>Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining >Cl-GBI: A Novel Approach for Extracting Typical Patterns from Graph-Structured Data
【24h】

Cl-GBI: A Novel Approach for Extracting Typical Patterns from Graph-Structured Data

机译:CL-GBI:一种从图形结构数据中提取典型模式的新方法

获取原文

摘要

Graph-Based Induction (GBI) is a machine learning technique developed for the purpose of extracting typical patterns from graph-structured data by stepwise pair expansion (pair-wise chunking). GBI is very efficient because of its greedy search strategy, however, it suffers from the problem of overlapping subgraphs. As a result, some of typical patterns cannot be discovered by GBI though a beam search has been incorporated in an improved version of GBI called Beam-wise GBI (B-GBI). In this paper, improvement is made on the search capability by using a new search strategy, where frequent pairs are never chunked but used as pseudo nodes in the subsequent steps, thus allowing extraction of overlapping subgraphs. This new algorithm, called Cl-GBI (Chunking-less GBI), was tested against two datasets, the promoter dataset from UCI repository and the hepatitis dataset provided by Chiba University, and shown successful in extracting more typical patterns than B-GBI.
机译:基于图形的感应(GBI)是一种机器学习技术,用于通过逐步对扩展(一对明示块)从图形结构数据中提取典型模式。由于其贪婪的搜索策略,GBI非常有效,但是它存在重叠子图的问题。结果,GBI不能发现一些典型的模式,尽管光束搜索已被包含在称为光束 - WISE GBI(B-GBI)的GBI版本的改进版本中。在本文中,通过使用新的搜索策略,在搜索能力上进行改进,其中频繁对从不划分,而是用作后续步骤中的伪节点,从而允许提取重叠的子图。这种称为CL-GBI(少块GBI)的新算法对来自UCI储存库的推进者数据集和千叶大学提供的肝炎数据集进行了测试,并在提取比B-GBI更加典型的图案中取得成功。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号