首页> 外文期刊>International Journal of Business Intelligence and Data Mining >Measuring stability of feature ranking techniques:a noise-based approach
【24h】

Measuring stability of feature ranking techniques:a noise-based approach

机译:测量特征排名技术的稳定性:基于噪声的方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

One very common criterion used to evaluate feature selection methods is the performance of a chosen classifier trained with the selected features. Another important evaluation criterion that has, until recently, been neglected is the stability of these feature selection methods. While other studies have shown interest in measuring the degree of agreement between the outputs of a technique trained on randomly selected subsets from the same input data, this study presents the importance of evaluating stability in the presence of noise. Experiments are conducted with 17 filters (six standard filter-based ranking techniques and 11 threshold-based feature selection techniques) on nine different real-world datasets. This paper identifies the techniques that are inherently more sensitive to class noise and demonstrates how certain characteristics (sample size and class imbalance) of the data can affect the stability performance of some feature selection methods.
机译:用于评估特征选择方法的一个非常常见的标准是使用选定特征训练的选定分类器的性能。直到最近被忽略的另一个重要评估标准是这些特征选择方法的稳定性。尽管其他研究表明有兴趣测量在相同输入数据中随机选择的子集上训练的技术的输出之间的一致性程度,但这项研究提出了评估存在噪声时稳定性的重要性。在9个不同的实际数据集上使用17个过滤器(六种基于标准过滤器的排名技术和11种基于阈值的特征选择技术)进行了实验。本文确定了本质上对类噪声更敏感的技术,并演示了数据的某些特征(样本大小和类不平衡)如何影响某些特征选择方法的稳定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号