首页> 外文期刊>Indian Journal of Science and Technology >A Novel Clustering based Feature Subset Selection Framework for Effective Data Classification
【24h】

A Novel Clustering based Feature Subset Selection Framework for Effective Data Classification

机译:一种新型的基于聚类的有效数据分类特征子集选择框架

获取原文
           

摘要

Background/Objectives: A novel feature selection framework using minimum variance method is proposed. The purpose of the proposed method is to reduce the computational complexity, reduce the number of initial features and increase the classification accuracy of the selected feature subsets. Methods/Statistical Analysis: The clusters are formed using minimum variance method. The process must be repeated for different pairs of records and voting is done on the different sets of cluster pairs. The cluster pair which has the maximum number of votes is chosen and the highest priority member is chosen from each cluster using information gain and removing the remaining attributes, thus reducing the number of attributes. Findings: The proposed feature selector is evaluated by comparing it with existing feature selection algorithms over 9 datasets from UCI and WebKb Datasets. The proposed method shows better results in terms of number of selected features, classification accuracy, and running time than most existing algorithms. Improvements/Applications: A new feature selector using minimum variance method is implemented and found that it performs better than the popular and computationally expensive traditional algorithms.
机译:背景/目的:提出了一种使用最小方差法的新颖特征选择框架。提出的方法的目的是减少计算复杂度,减少初始特征的数量并提高所选特征子集的分类精度。方法/统计分析:使用最小方差方法形成聚类。必须对不同的记录对重复该过程,并对不同的集群对集进行投票。使用信息增益并删除剩余属性,从每个群集中选择具有最大投票数的群集对,并从中选择最高优先级的成员,从而减少了属性的数量。结果:通过与UCI​​和WebKb数据集的9个数据集中的现有特征选择算法进行比较,对提议的特征选择器进行了评估。与大多数现有算法相比,该方法在选定特征的数量,分类准确性和运行时间方面显示出更好的结果。改进/应用:一种使用最小方差方法的新特征选择器得以实现,发现它的性能优于流行且计算量大的传统算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号