首页> 外文会议>7th Pacific Rim International Conference on Artificial Intelligence, Aug 18-22, 2002, Tokyo, Japan >Knowledge Discovery from Structured Data by Beam-Wise Graph-Based Induction
【24h】

Knowledge Discovery from Structured Data by Beam-Wise Graph-Based Induction

机译:基于波束明智图的归纳法从结构化数据中发现知识

获取原文
获取原文并翻译 | 示例

摘要

A machine learning technique called Graph-Based Induction (GBI) extracts typical patterns from graph data by stepwise pair expansion (pairwise chunking). Because of its greedy search strategy, it is very efficient but suffers from incompleteness of search. We improved its search capability without imposing much computational complexity by incorporating the idea of beam search. Additional improvement is made to extract patterns that are more discriminative than those simply occurring frequently, and to enumerate identical patterns accurately based on the notion of canonical labeling. This new algorithm was implemented (now called Beam-wise GBI, B-GBI for short) and tested against a DNA data set from UCI repository. Since DNA data is a sequence of symbols, representing each sequence by attribute-value pairs by simply assigning these symbols to the values of ordered attributes does not make sense. By transforming the sequence into a graph structure and running B-GBI it is possible to extract discriminative substructures. These can be new attributes for a classification problem. Effect of beam width on the number of discovered attributes and predictive accuracy was evaluated, together with extracted characteristic subsequences, and the results indicate the effectiveness of B-GBI.
机译:一种称为基于图的归纳(GBI)的机器学习技术通过逐步对扩展(成对分块)从图数据中提取典型模式。由于其贪婪的搜索策略,它非常有效,但存在搜索不完整的问题。通过合并波束搜索的思想,我们在不增加太多计算复杂性的情况下提高了其搜索能力。进行了其他改进,以提取比简单频繁出现的模式更具区别性的模式,并基于规范标记的概念准确枚举相同的模式。实施了这种新算法(现在简称为Beam-wise GBI,简称B-GBI),并针对UCI存储库中的DNA数据集进行了测试。由于DNA数据是符号序列,因此仅通过将这些符号分配给有序属性的值来用属性值对表示每个序列是没有意义的。通过将序列转换为图结构并运行B-GBI,可以提取出判别性子结构。这些可能是分类问题的新属性。评估了波束宽度对发现的属性数量和预测精度的影响,以及提取的特征子序列,结果表明了B-GBI的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号