首页> 外文会议>European conference on applications of evolutionary computation >Bagging and Feature Selection for Classification with Incomplete Data
【24h】

Bagging and Feature Selection for Classification with Incomplete Data

机译:不完整数据分类的装袋和特征选择

获取原文

摘要

Missing values are an unavoidable issue of many real-world datasets. Dealing with missing values is an essential requirement in classification problem, because inadequate treatment with missing values often leads to large classification errors. Some classifiers can directly work with incomplete data, but they often result in big classification errors and generate complex models. Feature selection and bagging have been successfully used to improve classification, but they are mainly applied to complete data. This paper proposes a combination of bagging and feature selection to improve classification with incomplete data. To achieve this purpose, a wrapper-based feature selection which can directly work with incomplete data is used to select suitable feature subsets for bagging. The experiments on eight incomplete datasets were designed to compare the proposed method with three other popular methods that are able to deal with incomplete data using C4.5/REPTree as classifiers and using Particle Swam Optimisation as a search technique in feature selection. Results show that the combination of bagging and feature selection can not only achieve better classification accuracy than the other methods but also generate less complex models compared to the bagging method.
机译:缺少值是许多现实世界数据集不可避免的问题。处理缺失值是分类问题中的基本要求,因为对缺失值的不充分处理通常会导致较大的分类错误。一些分类器可以直接处理不完整的数据,但是它们通常会导致较大的分类错误并生成复杂的模型。特征选择和装袋已成功用于改善分类,但它们主要应用于完整数据。本文提出了将装袋和特征选择相结合的方法,以改进不完整数据的分类。为了达到这个目的,可以直接使用不完整数据的基于包装的特征选择来选择合适的特征子集进行装袋。设计了八个不完整数据集上的实验,以将所提出的方法与其他三个可以使用C4.5 / REPTree作为分类器,并使用粒子游动优化作为特征选择的搜索技术来处理不完整数据的流行方法进行比较。结果表明,与套袋方法相比,套袋和特征选择的结合不仅可以实现更好的分类精度,而且还可以生成更简单的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号