首页> 外文期刊>Neurocomputing >A niching genetic programming-based multi-objective algorithm for hybrid data classification
【24h】

A niching genetic programming-based multi-objective algorithm for hybrid data classification

机译:基于小生境遗传规划的多目标混合数据分类算法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper introduces a multi-objective algorithm based on genetic programming to extract classification rules in databases composed of hybrid data, i.e., regular (e.g. numerical, logical, and textual) and non-regular (e.g. geographical) attributes. This algorithm employs a niche technique combined with a population archive in order to identify the rules that are more suitable for classifying items amongst classes of a given data set. The algorithm is implemented in such a way that the user can choose the function set that is more adequate for a given application. This feature makes the proposed approach virtually applicable to any kind of data set classification problem. Besides, the classification problem is modeled as a multi-objective one, in which the maximization of the accuracy and the minimization of the classifier complexity are considered as the objective functions. A set of different classification problems, with considerably different data sets and domains, has been considered: wines, patients with hepatitis, incipient faults in power transformers and level of development of cities. In this last data set, some of the attributes are geographical, and they are expressed as points, lines or polygons. The effectiveness of the algorithm has been compared with three other methods, widely employed for classification: Decision Tree (C4.5), Support Vector Machine (SVM) and Radial Basis Function (RBF). Statistical comparisons have been conducted employing one-way ANOVA and Tukey's tests, in order to provide reliable comparison of the methods. The results show that the proposed algorithm achieved better classification effectiveness in all tested instances, what suggests that it is suitable for a considerable range of classification applications.
机译:本文介绍了一种基于遗传编程的多目标算法,以从混合数据组成的数据库中提取分类规则,即常规数据(例如数字,逻辑和文本)和非常规数据(例如地理)。该算法采用利基技术与人口档案相结合,以识别更适合在给定数据集的类别中对项目进行分类的规则。该算法的实现方式是,用户可以选择对给定应用程序更合适的功能集。此功能使所提出的方法实际上适用于任何种类的数据集分类问题。此外,将分类问题建模为一个多目标模型,其中将准确性的最大化和分类器复杂度的最小化作为目标函数。已经考虑了一系列具有不同数据集和领域的不同分类问题:葡萄酒,肝炎患者,电力变压器的初期故障和城市发展水平。在最后一个数据集中,某些属性是地理属性,它们表示为点,线或多边形。该算法的有效性已与其他三种广泛用于分类的方法进行了比较:决策树(C4.5),支持向量机(SVM)和径向基函数(RBF)。为了提供方法的可靠比较,已经使用单向方差分析和Tukey检验进行了统计比较。结果表明,该算法在所有测试实例中均具有较好的分类效果,这表明该算法适用于相当大范围的分类应用。

著录项

  • 来源
    《Neurocomputing》 |2014年第10期|342-357|共16页
  • 作者单位

    Departamento de Computacao (DECOM/CEFET-MG), CEFET-MG Av. Amazonas, 7675, 30510-000 Belo Horizonte, MG, Brasil,Electrical Engineering Graduate Program (PPGEE/UFMG), UFMG Av. Antonio Carlos, 6627, 31270-010 Belo Horizonte, MG, Brasil;

    Departamento de Ciencia da Computacao (DCC/UFMG), UFMG Av. Antonio Carlos, 6627, 31270-010 Belo Horizonte, MG, Brasil;

    Departamento de Engenharia Eletrica (DEE/UFMG), UFMG Av. Antonio Carlos. 6627, 31270-010 Belo Horizonte, MG, Brasil;

    Departamento de Engenharia Eletrica (DEE/UFMG), UFMG Av. Antonio Carlos. 6627, 31270-010 Belo Horizonte, MG, Brasil,Evolutionary Computing Laboratory (LCE/UFMG), UFMG Av. Antonio Carlos, 6627, 31270-010 Belo Horizonte, MG, Brasil;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Classification rules; Spatial data mining; Genetic programming; Multi-objective algorithm;

    机译:分类规则;空间数据挖掘;基因编程;多目标算法;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号