首页> 外文期刊>Information Fusion >On developing an automatic threshold applied to feature selection ensembles
【24h】

On developing an automatic threshold applied to feature selection ensembles

机译:开发应用于特征选择集合的自动阈值

获取原文
获取原文并翻译 | 示例
           

摘要

Feature selection ensemble methods are a recent approach aiming at adding diversity in sets of selected features, improving performance and obtaining more robust and stable results. However, using an ensemble introduces the need for an aggregation step to combine all the output methods that confirm the ensemble. Besides, when trying to improve computational efficiency, ranking methods that order all initial features are preferred, and so an additional thresholding step is also mandatory. In this work two different ensemble designs based on ranking methods are described. The main difference between them is the order in which the combination and thresholding steps are performed. In addition, a new automatic threshold based on the combination of three data complexity measures is proposed and compared with traditional thresholding approaches based on retaining a fixed percentage of features. The behavior of these methods was tested, according to the SVM classification accuracy, with satisfactory results, for three different scenarios: synthetic datasets and two types of real datasets (where sample size is much higher than feature size, and where feature size is much higher than sample size).
机译:特征选择集合方法是最近的一种方法,旨在在所选功能集中添加多样性,提高性能并获得更强大和稳定的结果。但是,使用合奏介绍了对聚合步骤的需求,以组合确认合奏的所有输出方法。此外,在尝试提高计算效率时,排名方法是优选所有初始特征的排序方法,因此也是强制性的阈值步骤。在这项工作中,描述了基于排名方法的两种不同的集合设计。它们之间的主要区别是执行组合和阈值处理步骤的顺序。另外,提出了一种基于三个数据复杂度措施的组合的新的自动阈值,并基于保持固定的特征百分比与传统的阈值方法进行比较。根据SVM分类准确性,测试了这些方法的行为,具有令人满意的结果,对于三种不同的方案:合成数据集和两种类型的实时数据集(其中样本大小远高于特征大小,其中特征尺寸高得多比样本大小)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号