关系tri-training:利用无标记数据学习一阶规则

李艳娟; 郭茂祖

首页> 中文期刊> 《计算机科学与探索》 >关系tri-training:利用无标记数据学习一阶规则

关系tri-training:利用无标记数据学习一阶规则

AI论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

针对目前归纳逻辑程序设计(inductive logic programming,ILP)系统要求训练数据充分且无法利用无标记数据的不足,提出了一种利用无标记数据学习一阶规则的算法——关系tri-training(relational-tri-training,R-tri-training)算法.该算法将基于命题逻辑表示的半监督学习算法tri-training的思想引入到基于一阶逻辑表示的ILP系统,在ILP框架下研究如何利用无标记样例信息辅助分类器训练.R-tri-training算法首先根据标记数据和背景知识初始化三个不同的ILP系统,然后迭代地用无标记样例对三个分类器进行精化,即如果两个分类器对一个无标记样例的标记结果一致,则在一定条件下该样例将被标记给另一个分类器作为新的训练样例.标准数据集上实验结果表明:R-tri-training能有效地利用无标记数据提高学习性能,且R-tri-training算法性能优于GILP(genetic inductive logic programming)、NFOIL、KFOIL和ALEPH.%For the current inductive logic programming (ILP) system, the sufficient training datasets are required and the unlabeled data cannot be used. To solve this limitation, this paper introduces a first-order rule-learning algorithm exploiting the unlabeled data, named relational-tri-training (R-tri-training). This algorithm combines the tri-training based on propositional logic representation and ILP based on first-order logic representation, investigates the issue how to improve the performance of classifiers using the unlabeled data under the framework of ILP. Three different ILP systems are initialized according to the labeled data and the background knowledge, and then the three classifiers are refined by iteratively using the unlabeled data. That is, under special condition, the unlabeled data are going to be labeled to one classifier as the new training data when the same labeled results are given by the other two classifiers. Experimental results on the well-known benchmarks show that R-tri-training can effectively enhance the learning performance by exploiting the unlabeled data, and the performance of R-tri-training is better than genetic inductive logic programming (GILP), NFOIL, KFOIL and ALEPH.

著录项

来源
《计算机科学与探索》 |2012年第5期|430-442|共13页
作者
李艳娟; 郭茂祖;
展开▼
作者单位

哈尔滨工业大学计算机科学与技术学院;

哈尔滨150001;

东北林业大学信息与计算机工程学院;

哈尔滨150040;

哈尔滨工业大学计算机科学与技术学院;

哈尔滨150001;

展开▼
原文格式 PDF
正文语种 chi
中图分类人工智能理论;
关键词
机器学习; 归纳逻辑程序设计(ILP); 关系tri-training; 概率近似正确(PAC)可学习;

相似文献

中文文献
外文文献
专利

1. 利用产量特殊配合力数据确定28个热带玉米开放授粉品种的杂种优势组和利用RAPD标记确定其系统发育关系 [J] . S.N.Parentoni ,邱敦莲 ,等 . 国外作物育种 . 2002,第3期
2. 利用语义网技术实现铁路交通的地理语义查询(二)——从关系数据库中创建本体与定义推理规则 [J] . 董志 . 电脑编程技巧与维护 . 2013,第13期
3. 利用关系数据库实现无模式XML数据管理平台 [J] . 陈睿 ,林广艳 . 计算机工程与设计 . 2005,第1期
4. 无标记数据学习及其在图像检索中的应用 [J] . 武永成 . 软件导刊 . 2013,第3期
5. 半监督学习中非标记数据的利用 [J] . 罗进 ,周学君 . 湖北大学学报（自然科学版） . 2008,第1期
6. 基于Tri-Training的事件关系分类方法研究 [C] . DING Siyuan ,丁思远 ,HONG Yu . 中国中文信息学会2015学术年会（CIPS2015）暨第十四届全国计算语言学学术会议（CCL2015）、第三届基于自然标注大数据的自然语言处理国际学术研讨会（NLP-NABD2015） . 2015
7. 面向无标记数据和相变的归纳逻辑程序设计学习算法 [A] . 李艳娟 . 2012

关系tri-training:利用无标记数据学习一阶规则

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅