首页> 美国卫生研究院文献>Entropy >Multi-Objective Evolutionary Rule-Based Classification with Categorical Data
【2h】

Multi-Objective Evolutionary Rule-Based Classification with Categorical Data

机译:基于多目标进化规则的基于范畴分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The ease of interpretation of a classification model is essential for the task of validating it. Sometimes it is required to clearly explain the classification process of a model’s predictions. Models which are inherently easier to interpret can be effortlessly related to the context of the problem, and their predictions can be, if necessary, ethically and legally evaluated. In this paper, we propose a novel method to generate rule-based classifiers from categorical data that can be readily interpreted. Classifiers are generated using a multi-objective optimization approach focusing on two main objectives: maximizing the performance of the learned classifier and minimizing its number of rules. The multi-objective evolutionary algorithms ENORA and NSGA-II have been adapted to optimize the performance of the classifier based on three different machine learning metrics: accuracy, area under the ROC curve, and root mean square error. We have extensively compared the generated classifiers using our proposed method with classifiers generated using classical methods such as PART, JRip, OneR and ZeroR. The experiments have been conducted in full training mode, in 10-fold cross-validation mode, and in train/test splitting mode. To make results reproducible, we have used the well-known and publicly available datasets Breast Cancer, Monk’s Problem 2, Tic-Tac-Toe-Endgame, Car, kr-vs-kp and Nursery. After performing an exhaustive statistical test on our results, we conclude that the proposed method is able to generate highly accurate and easy to interpret classification models.
机译:对分类模型的易于解释对于验证它的任务至关重要。有时需要清楚地解释模型预测的分类过程。与问题的背景上毫不费力地毫不费力地毫不费力地毫不费力地毫不费力地逐步地相关的模型可以是必要的道德和合法评估的。在本文中,我们提出了一种新的方法来从可以容易解释的分类数据生成基于规则的分类器。使用多目标优化方法生成分类器,专注于两个主要目标:最大化学习分类器的性能,并最大限度地减少其规则数。多目标进化算法ENORA和NSGA-II已经适用于基于三种不同的机器学习指标优化分类器的性能:ROC曲线下的精度,区域和均方根误差。我们使用我们的建议方法进行了广泛的比较了生成的分类器,其中使用经典方法生成的分类器,例如零件,Jrip,Oner和Zeror。实验已经以10倍交叉验证模式,以及火车/测试分裂模式进行全面训练模式进行。为了使结果可重复,我们使用了众所周知和公开的数据集乳腺癌,僧侣的问题2,TIC-TAC-ENDAME,汽车,KR-VS-KP和幼儿园。在对我们的结果进行详尽的统计测试后,我们得出结论,该方法能够产生高精度且易于解释的分类模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号