首页> 外文期刊>Journal of statistical computation and simulation >A novel feature selection approach based on clustering algorithm
【24h】

A novel feature selection approach based on clustering algorithm

机译:一种基于聚类算法的新颖特征选择方法

获取原文
获取原文并翻译 | 示例

摘要

Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
机译:聚类是数据挖掘的主要方法之一。 K-means算法是由于其效率和易用性的最常见的聚类算法之一。在许多数据挖掘问题中,数据集包含大量字段,因此,有效字段的标识是一个重要问题。应用该算法,将识别数据集的重要变量。在所提出的方法中,数据集在多个阶段中群集,在每个步骤中,检查创建的群集的特性,并将变换集群结构的特征作为数据集的有效特征。在4个数据集上检查了所提出的方法,将该方法的结果与其他类似的工作进行比较,并证明使用该算法将消除数据集的冗余和不相关的特征,并提高分类精度。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号