首页> 外文会议>World Congress on Information and Communication Technologies >A preliminary study on the reuse of subtrees within decision trees in a genetic programming context for data classification
【24h】

A preliminary study on the reuse of subtrees within decision trees in a genetic programming context for data classification

机译:关于数据分类的基因编程背景中决策树中子树中子树中的重复利用初步研究

获取原文

摘要

Genetic programming (GP) has been successful in creating models for data classification which obtain high accuracies. In a programming context creating functions is a common practice as this serves as a way to isolate a part of code which can be reused. The encapsulation genetic operator is capable of promoting modularization in the sense that the operator can encapsulate subtrees which can be reused by GP trees during the execution of the algorithm. Models created for data classification problems tend to be large and of a certain complexity, and thus rendering the need for modular acquisition methods which promote the reuse of existing subtrees in order to solve the classification problems. The effect of the encapsulation operator for GP when solving data classification problems has not previously been investigated. Two approaches were proposed, the first incorporated the encapsulation operator with no limitations on how to use the encapsulated subtrees. The second approach made use of a maintained list of encapsulated subtrees. The two proposed methods were tested on eight data sets and the results show that the encapsulation operator improved the training accuracy on nearly every data set.
机译:遗传编程(GP)成功地创建了获得高精度的数据分类模型。在编程上下文中,创建功能是一个常见的做法,因为这是隔离可以重用的代码部分的一种方法。封装遗传算子能够在算法期间封装操作者可以通过GP树重复使用的子树来促进模块化。为数据分类问题创建的模型往往是大的并且具有一定的复杂性,从而需要对促进现有子树的重用以解决分类问题的模块采集方法的需求。封装运算符在解决数据分类问题时封装操作员对GP的影响尚未研究。提出了两种方法,首次融合了封装操作员,没有限制如何使用封装的子树。第二种方法利用维护的封装子列表。在八个数据集中测试了两个提出的方法,结果表明,封装操作员几乎每个数据集都会提高训练准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号