首页> 外文期刊>Neural computing & applications >Efficient feature selection and classification algorithm based on PSO and rough sets
【24h】

Efficient feature selection and classification algorithm based on PSO and rough sets

机译:基于PSO和粗糙集的高效特征选择和分类算法

获取原文
获取原文并翻译 | 示例
       

摘要

The high-dimensional data are often characterized by more number of features with less number of instances. Many of the features are irrelevant and redundant. These features may be especially harmful in case of extreme number of features carries the problem of memory usage in order to represent the datasets. On the other hand relatively small training set, where this irrelevancy and redundancy makes harder to evaluate. Hence, in this paper we propose an efficient feature selection and classification method based on Particle Swarm Optimization (PSO) and rough sets. In this study, we propose the inconsistency handler algorithm for handling inconsistency in dataset, new quick reduct algorithm for handling irrelevant/noisy features and fitness function with three parameters, the classification quality of feature subset, remaining features and the accuracy of approximation. The proposed method is compared with two traditional and three fusion of PSO and rough set-based feature selection methods. In this study, Decision Tree and Naive Bayes classifiers are used to calculate the classification accuracy of the selected feature subset on nine benchmark datasets. The result shows that the proposed method can automatically selects small feature subset with better classification accuracy than using all features. The proposed method also outperforms the two traditional and three existing PSO and rough set-based feature selection methods in terms of the classification accuracy, cardinality of feature and stability indices. It is also observed that with increased weight on the classification quality of feature subset of the fitness function, there is a significant reduction in the cardinality of features and also achieve better classification accuracy as well.
机译:高维数据的特征在于具有较少数量的实例的特征的特征。许多功能都是无关紧要的。在极端特征的情况下,这些特征可能特别有害,以便表示数据集。另一方面,相对小的训练集,这种无关紧要和冗余使得更难评估。因此,在本文中,我们提出了一种基于粒子群优化(PSO)和粗糙集的有效特征选择和分类方法。在本研究中,我们提出了用于处理数据集中的不一致处理程序算法,用于处理无关/噪声功能的新的快速减减算法和具有三个参数的健身功能,特征子集的分类质量,剩余特征和近似精度。该方法与PSO和粗糙集的特征选择方法的两个传统和三种融合进行了比较。在本研究中,决策树和朴素贝叶斯分类器用于计算九个基准数据集上所选功能子集的分类准确性。结果表明,所提出的方法可以自动选择具有比使用所有功能更好的分类精度的小功能子集。该方法还优于两种传统和三个现有的PSO和粗糙集基的特征选择方法,以分类准确性,特征和稳定性指标的基数。还观察到,随着体重函数的特征子集的分类质量的重量增加,特征的基数显着降低,并且也实现了更好的分类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号