...
首页> 外文期刊>Neurocomputing >Feature selection based on measurement of ability to classify subproblems
【24h】

Feature selection based on measurement of ability to classify subproblems

机译:基于度量子问题分类能力的特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

Feature selection is important and necessary especially for processing large scale data. Existing feature selection methods generally compute a discriminant value with respect to class variable for a feature to indicate its classification ability. Such a scalar value can hardly reveal the multi-faceted classification abilities of a feature for the different subproblems in a classification task. In this paper, an effective way is proposed for feature selection based on measurement of ability to classify subproblems and discrimination structure complementarity of features. The classification abilities of a feature for different subproblems are calculated respectively. Hence for the feature, a discrimination structure vector representing its classification abilities for all subproblems can be obtained. In feature selection, the features, which can individually classify as many subproblems as possible, are firstly evaluated and selected. Subsequently, their complementary features are selectively chosen, which can complementarily classify the subproblems that the selected features cannot classify. Two algorithms are designed for progressively selecting features, by firstly eliminating irrelevant features and then abandoning redundant features based on discrimination structure complementarity. The proposed algorithms are compared with some related methods for feature selection on some open gene expression datasets and UCI datasets. Experimental results demonstrate the effectiveness of the proposed method.
机译:特征选择非常重要且必要,特别是在处理大规模数据时。现有的特征选择方法通常针对特征的类别变量计算判别值以指示特征的分类能力。这样的标量值几乎无法揭示分类任务中不同子问题的要素的多方面分类能力。本文提出了一种基于子问题分类能力的度量和特征的判别结构互补性的特征选择方法。分别计算特征针对不同子问题的分类能力。因此,对于该特征,可以获得代表其对所有子问题的分类能力的判别结构向量。在特征选择中,首先评估和选择可以单独分类尽可能多的子问题的特征。随后,有选择地选择它们的互补特征,这可以互补地分类所选特征不能分类的子问题。设计了两种算法来逐步选择特征,方法是首先消除不相关的特征,然后基于判别结构的互补性放弃冗余特征。将所提出的算法与一些相关方法进行比较,以在一些开放基因表达数据集和UCI数据集上进行特征选择。实验结果证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号