【24h】

Hypothesis-margin model incorporating structure information for feature selection

机译:包含结构信息以进行特征选择的假设边界模型

获取原文

摘要

Iterative search margin based algorithm (Simba) has been proven effective for feature selection. However, the previously proposed model does not effectively utilize the structure information hidden in data which may have a great impact on the generalization performance of post-analysis classifiers. In this paper, we introduce a novel hypothesis-margin model incorporating structure information for feature seection(Ssimba_FS). In the newly developed model, the structure information induced by clustering algorithms is incorporated into the existing hypothesis margin model for feature selection, and meanwhile the contribution of the structure information can be effectively adjusted by a trade-off parameter. Based on Ssintba_FS, we present a novel algorithm for feature selection(Ssimba). By Ssimba, an effectively ranked feature list can be obtained, futher a compact and relevant feature subset can be directly generated from the ranked feature list. The experiments on 6 real-life benchmark datasets show that the classifiers induced by the algorithm of this paper has better or comparable classification performance than those established by Simba in most cases.
机译:基于迭代搜索余量的算法(Simba)已被证明对特征选择有效。然而,先前提出的模型不能有效利用隐藏在数据中的结构信息,这可能对分析后分类器的泛化性能产生很大影响。在本文中,我们介绍了一种新的假设-边际模型,该模型结合了用于特征选择的结构信息(Ssimba_FS)。在新开发的模型中,将通过聚类算法得出的结构信息合并到现有的假设裕度模型中进行特征选择,同时可以通过权衡参数有效地调整结构信息的贡献。基于Ssintba_FS,我们提出了一种新的特征选择算法(Ssimba)。通过Ssimba,可以获得有效排序的特征列表,并且进一步可以从排序的特征列表直接生成紧凑且相关的特征子集。在6个现实生活中的基准数据集上进行的实验表明,在大多数情况下,与Simba建立的分类器相比,本文算法产生的分类器具有更好或相当的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号