首页> 外文会议>International conference on mining intelligence and knowledge exploration >Sequential Instance Based Feature Subset Selection for High Dimensional Data
【24h】

Sequential Instance Based Feature Subset Selection for High Dimensional Data

机译:高维数据的基于顺序实例的特征子集选择

获取原文
获取外文期刊封面目录资料

摘要

Feature subset selection is a key problem in the data-mining classification task that helps to obtain more compact and understandable models without degrading their performance. This paper deals with the problem of supervised wrapper based feature subset selection in data sets with a very large number of attributes and a low sample size. In this case, standard wrapper algorithms cannot be applied because of their complexity. In this work we propose a new hybrid -filter wrapper-approach based on instance learning with the main goal of accelerating the feature subset selection process by reducing the number of wrapper evaluations. In our hybrid feature selection method, named Hybrid Instance Based Sequential Backward Search (HIB-SBS), instance learning is used to weight features and generate candidate feature subsets, then SBS and K-nearest neighbours (KNN) compose an evaluation system of wrappers. Our method is experimentally tested and compared with state-of-the-art algorithms over four high-dimensional low sample size datasets. The results show an impressive reduction in the execution time compared to the wrapper approach and that our proposal outperforms other methods in terms of accuracy and cardinality of the selected subset.
机译:特征子集的选择是数据挖掘分类任务中的关键问题,有助于在不降低模型性能的情况下获得更紧凑,更易理​​解的模型。本文研究了在属性数量非常大且样本量较小的数据集中基于监督包装的特征子集选择问题。在这种情况下,标准包装器算法由于其复杂性而无法应用。在这项工作中,我们基于实例学习提出了一种新的混合过滤器包装方法,其主要目标是通过减少包装器评估的次数来加速特征子集选择过程。在我们的混合特征选择方法中,称为基于混合实例的顺序向后搜索(HIB-SBS),实例学习用于加权特征并生成候选特征子集,然后SBS和K最近邻(KNN)组成包装器评估系统。我们的方法经过实验测试,并在四个高维,低样本量数据集上与最新算法进行了比较。结果表明,与包装方法相比,执行时间显着减少,并且我们的建议在选定子集的准确性和基数方面优于其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号