首页> 外文会议>International Workshop on Multiple Classifier Systems(MCS 2007); 20070523-25; Prague(CZ) >Random Feature Subset Selection for Ensemble Based Classification of Data with Missing Features
【24h】

Random Feature Subset Selection for Ensemble Based Classification of Data with Missing Features

机译:基于集成的缺失特征数据分类的随机特征子集选择

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

We report on our recent progress in developing an ensemble of classifiers based algorithm for addressing the missing feature problem. Inspired in part by the random subspace method, and in part by an AdaBoost type distribution update rule for creating a sequence of classifiers, the proposed algorithm generates an ensemble of classifiers, each trained on a different subset of the available features. Then, an instance with missing features is classified using only those classifiers whose training dataset did not include the currently missing features. Within this framework, we experiment with several bootstrap sampling strategies each using a slightly different distribution update rule. We also analyze the effect of the algorithm's primary free parameter (the number of features used to train each classifier) on its performance. We show that the algorithm is able to accommodate data with up to 30% missing features, with little or no significant performance drop.
机译:我们报告了我们在开发基于分类器的整体算法以解决缺失特征问题方面的最新进展。所提出的算法部分受到随机子空间方法的启发,部分受到用于创建分类器序列的AdaBoost类型分布更新规则的启发,生成了一组分类器,每个分类器都在可用功能的不同子集上进行训练。然后,仅使用那些训练数据集不包括当前缺失特征的分类器对具有缺失特征的实例进行分类。在此框架内,我们尝试使用几种稍有不同的分布更新规则的自举抽样策略。我们还分析了算法的主要自由参数(用于训练每个分类器的特征数量)对其性能的影响。我们证明了该算法能够容纳多达30%缺失特征的数据,而性能下降很少或没有。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号