首页> 外文期刊>Applied computational intelligence and soft computing >A Comparison Study on Rule Extraction from Neural Network Ensembles, Boosted Shallow Trees, and SVMs
【24h】

A Comparison Study on Rule Extraction from Neural Network Ensembles, Boosted Shallow Trees, and SVMs

机译:神经网络集成,增强浅树和支持向量机的规则提取比较研究

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

One way to make the knowledge stored in an artificial neural network more intelligible is to extract symbolic rules. However, producing rules from Multilayer Perceptrons (MLPs) is an NP-hard problem. Many techniques have been introduced to generate rules from single neural networks, but very few were proposed for ensembles. Moreover, experiments were rarely assessed by 10-fold cross-validation trials. In this work, based on the Discretized Interpretable Multilayer Perceptron (DIMLP), experiments were performed on 10 repetitions of stratified 10-fold cross-validation trials over 25 binary classification problems. The DIMLP architecture allowed us to produce rules from DIMLP ensembles, boosted shallow trees (BSTs), and Support Vector Machines (SVM). The complexity of rulesets was measured with the average number of generated rules and average number of antecedents per rule. From the 25 used classification problems, the most complex rulesets were generated from BSTs trained by "gentle boosting" and "real boosting." Moreover, we clearly observed that the less complex the rules were, the better their fidelity was. In fact, rules generated from decision stumps trained by modest boosting were, for almost all the 25 datasets, the simplest with the highest fidelity. Finally, in terms of average predictive accuracy and average ruleset complexity, the comparison of some of our results to those reported in the literature proved to be competitive.
机译:使人工神经网络中存储的知识更易于理解的一种方法是提取符号规则。但是,从多层感知器(MLP)生成规则是NP难题。已经引入了许多技术来从单个神经网络生成规则,但是很少有人提出用于集成的规则。此外,很少通过10倍交叉验证试验来评估实验。在这项工作中,基于离散可解释的多层感知器(DIMLP),针对25个二元分类问题对10次分层10倍交叉验证试验的重复进行了实验。 DIMLP体系结构使我们能够从DIMLP集成,增强浅树(BST)和支持向量机(SVM)生成规则。规则集的复杂性通过生成的规则的平均数量和每个规则的先行数量来衡量。从使用的25个分类问题中,最复杂的规则集是由经过“温和增强”和“实际增强”训练的BST生成的。此外,我们清楚地观察到,规则越复杂,其保真度就越好。实际上,在几乎所有25个数据集中,通过适度增强训练的决策树桩生成的规则都是最简单,保真度最高的数据集。最后,就平均预测准确性和平均规则集复杂度而言,将我们的某些结果与文献报道的结果进行比较是具有竞争力的。

著录项

  • 来源
    《Applied computational intelligence and soft computing》 |2018年第2018期|4084850.1-4084850.20|共20页
  • 作者

    Guido Bologna; Yoichi Hayashi;

  • 作者单位

    Department of Computer Science, University of Applied Sciences and Arts Western Switzerland, Rue de la Prairie 4,1202 Geneva, Switzerland,Department of Computer Science, University of Geneva, Route de Drize 7, 1227 Carouge, Switzerland;

    Department of Computer Science, Meiji University, Tama-ku, Kawasaki, Kanagawa 214-8571, Japan;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号