Constructing a Decision Tree for Graph-Structured Data and its Applications

Warodom Geamsakul; Tetsuya Yoshida; Kouzou Ohara; Hiroshi Motoda; Hideto Yokoi; Katsuhiko Takabayashi

首页> 外文期刊>Fundamenta Informaticae >Constructing a Decision Tree for Graph-Structured Data and its Applications

【24h】

Constructing a Decision Tree for Graph-Structured Data and its Applications

机译：图结构化数据的决策树的构建及其应用

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A machine learning technique called Graph-Based Induction (GBI) efficiently extracts typical patterns from graph-structured data by stepwise pair expansion (pairwise chunking). It is very efficient because of its greedy search. Meanwhile, a decision tree is an effective means of data classification from which rules that are easy to understand can be obtained. However, a decision tree could not be constructed for the data which is not explicitly expressed with attribute-value pairs. This paper proposes a method called Decision Tree Graph-Based Induction (DT-GBI), which constructs a classifier (decision tree) for graph-structured data while simultaneously constructing attributes for classification using GBI. Substructures (patterns) are extracted at each node of a decision tree by stepwise pair expansion in GBI to be used as attributes for testing. Since attributes (features) are constructed while a classifier is being constructed, DT-GBI can be conceived as a method for feature construction. The predictive accuracy of a decision tree is affected by which attributes (patterns) are used and how they are constructed. A beam search is employed to extract good enough discriminative patterns within the greedy search framework. Pessimistic pruning is incorporated to avoid overfitting to the training data. Experimenls using a DNA dalasel were conducted to see the effect of the beam width and the number of chunking at each node of a decision tree. The resulls indicate that DT-GBI thal uses very little prior domain knowledge can construct a decision tree that is comparable to other classifiers constructed using the domain knowledge. DT-GBI was also applied to analyze a real-world hepatitis dataset as a part of evidence-based medicine. Four classification tasks of the hepatitis data were conducted using only the time-series data of blood inspection and urinalysis. The preliminary results of experiments, both constructed decision trees and their predictive accuracies as well as extracted patterns, are reported in this paper. Some of the patterns match domain experts' experience and the overall results are encouraging.

机译：一种称为基于图的归纳（GBI）的机器学习技术可以通过逐步对扩展（成对分块）从图结构化数据中高效提取典型模式。由于贪婪的搜索，它非常有效。同时，决策树是一种有效的数据分类手段，从中可以获得易于理解的规则。但是，无法为未使用属性值对明确表示的数据构建决策树。本文提出了一种称为决策树基于图的归纳法（DT-GBI），该方法构造了图结构数据的分类器（决策树），同时使用GBI构造了用于分类的属性。通过在GBI中逐步扩展对，可以在决策树的每个节点上提取子结构（模式），以用作测试的属性。由于在构造分类器时构造了属性（特征），因此可以将DT-GBI视为构造特征的方法。决策树的预测准确性受使用的属性（模式）及其构造方式的影响。波束搜索用于在贪婪搜索框架内提取足够好的判别模式。引入了悲观修剪，以避免过度拟合训练数据。进行了使用DNA弹射的实验，以了解光束宽度和决策树每个节点处的分块数量的影响。结果表明DT-GBI使用很少的现有领域知识即可构建与使用该领域知识构建的其他分类器可比的决策树。 DT-GBI还被用于分析现实世界的肝炎数据集，作为循证医学的一部分。仅使用血液检查和尿液分析的时间序列数据来执行肝炎数据的四个分类任务。本文报道了实验的初步结果，包括构造的决策树及其预测精度以及提取的模式。一些模式与领域专家的经验相匹配，总体结果令人鼓舞。

著录项

来源
《Fundamenta Informaticae 》 |2005年第2期| p.131-160| 共30页
作者
Warodom Geamsakul; Tetsuya Yoshida; Kouzou Ohara; Hiroshi Motoda; Hideto Yokoi; Katsuhiko Takabayashi;
展开▼
作者单位

Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术 ;
关键词
graph mining; graph-based induction; decision tree; beam search; evidence-based medicine;

机译：图挖掘;基于图的归纳;决策树;波束搜索;循证医学;

相似文献

外文文献
中文文献
专利

1. A distributed placement service for graph-structured and tree-structured data [J] . Buehrer Gregory, Parthasarathy Srinivasan, Tatikonda Shirish ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2010 ,第5期

机译：用于图结构和树结构数据的分布式放置服务
2. A distributed placement service for graph-structured and tree-structured data [J] . Buehrer Gregory, Parthasarathy Srinivasan, Tatikonda Shirish ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2010 ,第5期

机译：用于图结构和树结构数据的分布式放置服务
3. Usefulness of a decision tree model for the analysis of adverse drug reactions: Evaluation of a risk prediction model of vancomycin‐associated nephrotoxicity constructed using a data mining procedure [J] . Imai Shungo, Yamada Takehiro, Kasashi Kumiko, Journal of evaluation in clinical practice . 2017 ,第6期

机译：用于分析不良药物反应的决策树模型的有用性：使用数据采矿过程构建的万古霉素相关肾毒性风险预测模型的评价
4. Constructing a Decision Tree for Graph-Structured Data and its Applications [C] . Warodom Geamsakul, Tetsuya Yoshida, Kouzou Ohara, Advances in Mining Graphs, Trees and Sequences . 2005

机译：构建图形结构数据及其应用的决策树
5. Development of Enhanced Neural Decision Tree Model and Application of Data Mining Techniques for Modeling of Petroleum Datasets [D] . ?Omer, Sardar Usman 2014

机译：增强神经决策树模型的发展与数据挖掘技术应用于石油数据集建模
6. An expert-guided decision tree construction strategy: an application in knowledge discovery with medical databases. [O] . Y. S. Tsai, P. H. King, M. S. Higgins, 1997

机译：专家指导的决策树构建策略：医学数据库在知识发现中的应用。
7. Adding Logical Operators to Tree Pattern Queries on Graph-Structured Data [O] . Zeng, Qiang, Jiang, Xiaorui, Zhuge, Hai 2012

机译：将逻辑运算符添加到图结构化的树模式查询中数据

Constructing a Decision Tree for Graph-Structured Data and its Applications

摘要

著录项

相似文献

相关主题

期刊订阅