...
首页> 外文期刊>Computational Intelligence >A NEW SUPERVISED FEATURE SELECTION METHOD FOR PATTERN CLASSIFICATION
【24h】

A NEW SUPERVISED FEATURE SELECTION METHOD FOR PATTERN CLASSIFICATION

机译:一种新的模式分类特征选择方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

With the rapid development of information techniques, the dimensionality of data in many application domains, such as text categorization and bioinformatics, is getting higher and higher. The high-dimensionality data may bring many adverse situations, such as overfitting, poor performance, and low efficiency, to traditional learning algorithms in pattern classification. Feature selection aims at reducing the dimensionality of data and providing discriminative features for pattern learning algorithms. Due to its effectiveness, feature selection is now gaining increasing attentions from a variety of disciplines and currently many efforts have been attempted in this field. In this paper, we propose a new supervised feature selection method to pick important features by using information criteria. Unlike other selection methods, the main characteristic of our method is that it not only takes both maximal relevance to the class labels and minimal redundancy to the selected features into account, but also works like feature clustering in an agglomerative way. To measure the relevance and redundancy of feature exactly, two different information criteria, i.e., mutual information and coefficient of relevance, have been adopted in our method. The performance evaluations on 12 benchmark data sets show that the proposed method can achieve better performance than other popular feature selection methods in most cases.
机译:随着信息技术的飞速发展,诸如文本分类和生物信息学等许多应用领域中的数据维度越来越高。高维数据可能给模式分类中的传统学习算法带来许多不利情况,例如过拟合,性能差和效率低下。特征选择旨在减少数据的维数并为模式学习算法提供判别性特征。由于其有效性,特征选择现在正受到各种学科的越来越多的关注,并且目前在该领域已进行了许多努力。在本文中,我们提出了一种新的有监督特征选择方法,该方法通过使用信息准则来选择重要特征。与其他选择方法不同,我们的方法的主要特点是它不仅将与类标签的最大相关性和对所选特征的最小冗余都考虑在内,而且还像聚类中的特征聚类一样工作。为了准确地测量特征的相关性和冗余性,我们的方法采用了两种不同的信息标准,即互信息和相关系数。对12个基准数据集的性能评估表明,在大多数情况下,该方法可以比其他流行的特征选择方法获得更好的性能。

著录项

  • 来源
    《Computational Intelligence》 |2014年第2期|342-361|共20页
  • 作者单位

    Department of Computer Science, Zhejiang Normal University, Jinhua, China ,National Center for Mathematics and Interdisciplinary Sciences, CAS, Beijing, China;

    Department of Computer Science, University of Vermont, Burlington, Vermont, USA ,College of Computer Science and Information Engineering, Hefei University of Technology, Heifei, China ,Department of Computer Science, University of Vermont, 33 Colchester Avenue, Burlington, VT 05405;

    Department of Computer Science, Guangxi Normal University, Guilin, China ,University of Technology, Sydney, Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    feature selection; dimensionality reduction; information entropy; pattern classification; clustering;

    机译:特征选择;降维;信息熵模式分类聚类;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号