On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems

Debie Essam; Shafi Kamran; Merrick Kathryn; Lokan Chris

首页> 外文期刊>Computational Intelligence >On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems

【24h】

On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems

机译：基于特征选择的学习分类器系统集成方法的数据挖掘问题分类与评价

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Ensemble methods aim at combining multiple learning machines to improve the efficacy in a learning task in terms of prediction accuracy, scalability, and other measures. These methods have been applied to evolutionary machine learning techniques including learning classifier systems (LCSs). In this article, we first propose a conceptual framework that allows us to appropriately categorize ensemble-based methods for fair comparison and highlights the gaps in the corresponding literature. The framework is generic and consists of three sequential stages: a pre-gate stage concerned with data preparation; the member stage to account for the types of learning machines used to build the ensemble; and a post-gate stage concerned with the methods to combine ensemble output. A taxonomy of LCSs-based ensembles is then presented using this framework. The article then focuses on comparing LCS ensembles that use feature selection in the pre-gate stage. An evaluation methodology is proposed to systematically analyze the performance of these methods. Specifically, random feature sampling and rough set feature selection-based LCS ensemble methods are compared. Experimental results show that the rough set-based approach performs significantly better than the random subspace method in terms of classification accuracy in problems with high numbers of irrelevant features. The performance of the two approaches are comparable in problems with high numbers of redundant features.

机译：集成方法旨在结合多个学习机，以根据预测准确性，可伸缩性和其他措施来提高学习任务的效率。这些方法已应用于包括学习分类器系统（LCS）在内的进化机器学习技术。在本文中，我们首先提出一个概念框架，使我们能够对基于集成的方法进行适当分类，以进行公平比较，并强调相应文献中的空白。该框架是通用的，由三个连续的阶段组成：与数据准备有关的登门前阶段；在成员阶段说明用于构建集成的学习机的类型；还有一个后门阶段，涉及合并整体输出的方法。然后使用此框架介绍了基于LCS的集成的分类法。然后，本文着重比较在预浇口阶段使用特征选择的LCS集成。提出了一种评估方法，以系统地分析这些方法的性能。具体而言，比较了随机特征采样和基于粗糙集特征选择的LCS集成方法。实验结果表明，在具有大量不相关特征的问题中，基于粗糙集的方法在分类准确度方面明显优于随机子空间方法。在具有大量冗余功能的问题中，这两种方法的性能相当。

著录项

来源
《Computational Intelligence》 |2017年第3期|554-578|共25页
作者
Debie Essam; Shafi Kamran; Merrick Kathryn; Lokan Chris;
展开▼
作者单位

Zagazig Univ, Fac Comp & Informat, Zagazig, Egypt;

UNSW Canberra, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia;

UNSW Canberra, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia;

UNSW Canberra, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
ensemble learning; feature selection; rough set theory; learning classifier systems;

机译：集成学习;特征选择;粗糙集理论;学习分类器系统;

相似文献

外文文献
中文文献
专利

1. A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring [J] . Fatemeh Nemati Koutanaei, Hedieh Sajedi, Mohammad Khanbabaei Journal of retailing and consumer services . 2015,第NOVa期

机译：特征选择算法和集成学习分类器的混合数据挖掘模型用于信用评分
2. New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers [J] . Jasmina Nalic, Goran Martinovic, Drago Zagar Advanced engineering informatics . 2020,第Auga期

机译：基于特征选择算法和集合分类的信用评分新混合数据挖掘模型
3. Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data [J] . Li Yijing, Guo Haixiang, Liu Xiao, Knowledge-Based Systems . 2016,第Feba15期

机译：基于多分类器系统和特征选择的自适应集成分类算法对多类不平衡数据进行分类
4. A Novel Feature Selection-Based Sequential Ensemble Learning Method for Class Noise Detection in High-Dimensional Data [C] . Kai Chen, Donghai Guan, Weiwei Yuan, International conference on advanced data mining and applications . 2018

机译：高维数据分类噪声的基于特征选择的序贯集合学习方法
5. Reliable recognition of handwritten digits using a cascade ensemble classifier system and hybrid features [D] . Zhang, Ping. 2006

机译：使用级联集成分类器系统和混合功能可靠地识别手写数字
6. Classifying Incomplete Gene-Expression Data: Ensemble Learning with Non-Pre-Imputation Feature Filtering and Best-First Search Technique [O] . Yuanting Yan, Tao Dai, Meili Yang, 2018

机译：对不完整的基因表达数据进行分类：使用非预先输入特征过滤和最佳优先搜索技术进行集成学习
7. in ‘Learning Classifier Systems for Data Mining: A Comparison of XCS with Other Classifiers for the Forest Cover Data Set [O] . A. J. Bagnall, G. C. Cawley 2013

机译：“用于数据挖掘的学习分类器系统：XCS与森林覆盖数据集的其他分类器的比较”

On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅