...
【24h】

A framework for cost-based feature selection

机译:基于成本的功能选择框架

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Over the last few years,the dimensionality of datasets involved in data mining applications has increased dramatically. In this situation, feature selection becomes indispensable as it allows for dimensionality reduction and relevance detection. The research proposed in this paper broadens the scope of feature selection by taking into consideration not only the relevance of the features but also their associated costs. A new general framework is proposed, which consists of adding a new term to the evaluation function of a filter feature selection method so that the cost is taken into account. Although the proposed methodology could be applied to any feature selection filter, in this paper the approach is applied to two representative filter methods: Correlation-based Feature Selection (CFS) and Minimal-Redundancy- Maximal-Relevance (mRMR), as an example of use. The behavior of the proposed framework is tested on 17 heterogeneous classification datasets, employing a Support Vector Machine (SVM) as a classifier. The results of the experimental study show that the approach is sound and that it allows the user to reduce the cost without compromising the classification error.
机译:在过去的几年中,数据挖掘应用程序中涉及的数据集的维数急剧增加。在这种情况下,特征选择变得必不可少,因为它允许降维和相关性检测。本文提出的研究不仅考虑了特征的相关性,还考虑了相关的成本,从而拓宽了特征选择的范围。提出了一种新的通用框架,该框架包括在过滤器特征选择方法的评估函数中添加新的术语,从而将成本考虑在内。尽管所提出的方法可以应用于任何特征选择过滤器,但在本文中,该方法已应用于两种代表性的过滤器方法:基于相关性的特征选择(CFS)和最小冗余-最大相关性(mRMR),例如用。使用支持向量机(SVM)作为分类器,在17个异构分类数据集上测试了所提出框架的行为。实验研究的结果表明,该方法是合理的,它可以使用户降低成本而不损害分类错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号