An Efficient Fast Clustering-Based Feature Subset Selection Algorithm for High- Dimensional Data

N.Magendiran; J.Jayaranjani

首页> 外文期刊>International Journal of Innovative Research in Science, Engineering and Technology >An Efficient Fast Clustering-Based Feature Subset Selection Algorithm for High- Dimensional Data

【24h】

An Efficient Fast Clustering-Based Feature Subset Selection Algorithm for High- Dimensional Data

机译：一种基于快速聚类的高维数据特征子集选择算法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection is the process of identifying a subset of the most useful features that produces compatible results as the original entire set of features. A feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. While the efficiency concerns the time required to find a subset of features, the effectiveness is related to the quality of the subset of features. Based on these criteria, a Fast clustering-based feature Selection algorithm (FAST) is proposed and experimentally evaluated. The FAST algorithm works in two steps. In the first step, features are divided into clusters by using graph-theoretic clustering methods. In the second step, the most representative feature that is strongly related to target classes is selected from each cluster to form a subset of features. Features in different clusters are relatively independent; the clustering-based strategy of FAST has a high probability of producing a subset of useful and independent features. The Minimum-Spanning Tree (MST) using Prim’s algorithm can concentrate on one tree at a time. To ensure the efficiency of FAST, adopt the efficient MST using the Kruskal’s Algorithm clustering method.

机译：特征选择是识别最有用特征的子集的过程，该子集将产生兼容的结果作为原始的整个特征集。可以从效率和有效性的角度来评估特征选择算法。尽管效率与找到特征子集所需的时间有关，但有效性与特征子集的质量有关。基于这些标准，提出了一种基于快速聚类的特征选择算法（FAST）并进行了实验评估。 FAST算法分两个步骤工作。第一步，使用图论聚类方法将特征划分为聚类。在第二步中，从每个聚类中选择与目标类密切相关的最具代表性的要素，以形成要素的子集。不同集群中的要素相对独立; FAST基于聚类的策略很可能产生有用且独立的功能子集。使用Prim算法的最小生成树（MST）可以一次集中在一棵树上。为了确保FAST的效率，请使用Kruskal的算法聚类方法采用有效的MST。

著录项

来源
《International Journal of Innovative Research in Science, Engineering and Technology》 |2014年第1期|共4页
作者
N.Magendiran; J.Jayaranjani;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. An Efficient Fast Clustering-Based Feature Subset Selection Algorithm for High- Dimensional Data [J] . N.Magendiran, J.Jayaranjani International Journal of Innovative Research in Science, Engineering and Technology . 2014,第1期

机译：一种基于快速聚类的高维数据特征子集选择算法
2. A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data [J] . Song Qinbao, Ni Jingjie, Wang Guangtao Knowledge and Data Engineering, IEEE Transactions on . 2013,第1期

机译：基于快速聚类的高维数据特征子集选择算法
3. Mining High Dimensional Data Using Attribute Clustering-Based Feature Subset Selection Algorithm [J] . Vivek Ravindra Prasad Pandey, T.Venu, N.Subhash Chandra International Journal of Computer Trends and Technology . 2014,第2期

机译：使用基于属性聚类的特征子集选择算法挖掘高维数据
4. Implementation of FAST Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data [C] . Smit Shilu, Kushal Sheth, Ekata Mehul International Conference on Information and Communication Technology for Sustainable Development . 2016

机译：基于快速聚类的特征子集选择算法的实现高维数据
5. Robust and efficient feature selection for high-dimensional datasets. [D] . Mo, Dengyao. 2011

机译：高维数据集的稳健而高效的特征选择。
6. An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset [O] . Senthilkumar Devaraj, S. Paulraj 2015

机译：多维数据集分类的有效特征子集选择算法
7. Efficient Feature Subset Selection Algorithm for High Dimensional Data [O] . Smita Chormunge, Sudarson Jena 2016

机译：高维数据的高效特征子集选择算法
8. Data Mining Feature Subset Weighting and Selection Using Genetic Algorithms [R] . 2002

机译：基于遗传算法的数据挖掘特征子集加权和选择

An Efficient Fast Clustering-Based Feature Subset Selection Algorithm for High- Dimensional Data

摘要

著录项

相似文献

相关主题

期刊订阅