Mining Chemical Compound Structure Data Using Inductive Logic Programming

机译：采用电感逻辑编程的化学复合结构数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discovering knowledge from chemical compound structure data is a challenge task in KDD. It aims to generate hypotheses describing activities or characteristics of chemical compounds from their own structures. Since each compound composes of several parts with complicated relations among them, traditional mining algorithms cannot handle this kind of data efficiently. In this research, we apply Inductive Logic Programming (ILP) for classifying chemical compounds. ILP provides comprehensibility to learning results and capability to handle more complex data consisting of their relations. Nevertheless, the bottleneck for learning first-order theory is enormous hypothesis search space which causes inefficient performance by the existing learning approaches compared to the propositional approaches. We introduces an improved ILP approach capable of handling more efficiently a kind of data called multiple-part data, i.e., one instance of data consists of several parts as well as relations among parts. The approach tries to find hypothesis describing class of each training example by using both individual and relational characteristics of its part which is similar to finding common substructures among the complex relational instances. Chemical compound data is multiple-part data. Each compound is composed of atoms as parts, and various kinds of bond as relations among atoms. We then apply the proposed algorithm for chemical compound structure by conducting experiments on two real-world datasets: mutagenicity in nitroaromatic compounds and dopamine antagonist compounds. The experiment results were compared to the previous approaches in order to show the performance of proposed approach.

机译：从化合物的结构数据发现知识是在KDD一个挑战的任务。它的目的是生成描述从自身结构的活动或化合物的特性假说。由于几部分与它们之间关系复杂每种化合物组成，传统的挖掘算法不能有效地处理这种类型的数据。在这项研究中，我们申请的化合物进行分类归纳逻辑程序设计（ILP）。 ILP提供可理解性学习成果和能力来处理由他们的关系更复杂的数据。然而，对于学习一阶理论的瓶颈是导致由现有的学习效率低下的表现方法相比，在命题的办法巨大假设搜索空间。我们引入了能够处理更有效的一种叫做多部分数据的数据，即数据的一个实例由几部分组成，以及零件之间的关系的改进ILP方法。该方法尝试通过使用其一部分，其类似于在复杂的关系实例之间找到共同子结构包括个人和关系特性找到假说描述类每个训练样例的。化学化合物数据是多部分数据。每个化合物由原子作为部件，以及各种键的作为原子间关系的。致突变性硝基芳烃和多巴胺拮抗剂化合物：然后，我们进行实验的两个真实世界的数据集应用算法的化合物结构。实验结果进行了比较，以前的方法，以表明该方法的性能。

著录项

来源
《International Workshop on Active Mining》|2005年||共20页
会议地点
作者
Cholwich Nattee; Sukree Sinthupinyo; Masayuki Numao; Takashi Okada;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据处理、数据处理系统;
关键词

相似文献

外文文献
中文文献
专利

1. Distance-based Heuristics in Inductive Logic Programming for Mining from Chemical Compound Data [J] . Cholwich NATTEE, Sukree SINTHUPINYO, Masayuki NUMAO, 電子情報通信学会技術研究報告. 人工知能と知識処理. Artificial Intelligence and Knowledge Based Processing . 2003,第306期

机译：从化合物数据中挖掘的基于逻辑的启发式逻辑编程
2. Distance-based Heuristics in Inductive Logic Programming for Mining from Chemical Compound Data [J] . Cholwich NATTEE, Sukree SINTHUPINYO, Masayuki NUMAO, 電子情報通信学会技術研究報告. 人工知能と知識処理. Artificial Intelligence and Knowledge Based Processing . 2003,第306期

机译：化学复合数据采矿诱导逻辑规划中的距离的启发式
3. Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds [J] . Edward O. Cannon, Ata Amini, Andreas Bender, Journal of Computer-Aided Molecular Design . 2007,第5期

机译：支持向量归纳逻辑编程的性能优于朴素贝叶斯分类器和归纳逻辑编程，可用于生物活性化合物的分类
4. Mining Chemical Compound Structure Data Using Inductive Logic Programming [C] . Cholwich Nattee, Sukree Sinthupinyo, Masayuki Numao, International Workshop on Active Mining . 2005

机译：采用电感逻辑编程的化学复合结构数据
5. Integrating top-down and bottom-up approaches in inductive logic programming: Applications in natural language processing and relational data mining. [D] . Tang, Lap Poon Rupert. 2003

机译：在归纳逻辑编程中集成了自上而下和自下而上的方法：在自然语言处理和关系数据挖掘中的应用。
6. An Inductive Logic Programming Approach to Validate Hexose Binding Biochemical Knowledge [O] . Houssam Nassif, Hassan Al-Ali, Sawsan Khuri, -1

机译：验证六角绑定生物化学知识的感应逻辑编程方法
7. Learning logic programs with structured background knowledge☆☆An extended abstract of this paper appeared in: L. De Raedt (Ed.), Proceedings of the Fifth International Workshop on Inductive Logic Programming, Tokyo, Japan, 1995, pp. 53–76, Scientific Report of the Department of Computer Science, Katholieke Universiteit Leuven, and also in the post-conference volume: L. De Raedt (Ed.), Advances in Inductive Logic Programming, IOS Press, Amsterdam/Ohmsha, Tokyo, 1996, pp. 172–191. [O] . Horváth Tamás, Turán György 2001

机译：具有结构化背景知识的学习逻辑程序☆☆本文的扩展摘要发表在：L.De Raedt（Ed。），第五届国际归纳逻辑编程研讨会论文集，日本东京，1995年，第53-76页，鲁汶大学（Keholieke Leuven）计算机科学系的科学报告，以及会议后论文集：L.De Raedt（Ed。），归纳逻辑程序设计进展，IOS出版社，阿姆斯特丹/欧姆沙，东京，1996年，第pp。 172–191。
8. Relational Data Mining with Inductive Logic Programming for Link Discovery [R] . Mooney, R. J., Melville, P., Tang, L. R., 2002

机译：用于链路发现的归纳逻辑编程的关系数据挖掘

Mining Chemical Compound Structure Data Using Inductive Logic Programming

摘要

著录项

相似文献

相关主题

期刊订阅