首页> 美国政府科技报告 >Data Mining Feature Subset Weighting and Selection Using Genetic Algorithms

【24h】

Data Mining Feature Subset Weighting and Selection Using Genetic Algorithms

机译：基于遗传算法的数据挖掘特征子集加权和选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a simple genetic algorithm (sGA), which is developed under Genetic Rule and Classifier Construction Environment (GRaCCE) to solve feature subset selection and weighting problem to have better classification accuracy on k-nearest neighborhood (KNN) algorithm. Our hypotheses are that weighting the features will affect the performance of the KNN algorithm and will cause better classification accuracy rate than that of binary classification. The weighted- SGA algorithm uses real-value chromosomes to find the weights for features and binary-sGA uses integer-value chromosomes to select the subset of features from original feature set. A Repair algorithm is developed for weighted-sGA algorithm to guarantee the feasibility of chromosomes. By feasibility we mean that the sum of values of each gene in a chromosome must be equal to 1. To calculate the fitness values for each chromosome in the population, we use K Nearest Neighbor Algorithm (KNN) as our fitness function. The Euclidean distance from one individual to other individuals is calculated on the d-dimensional feature space to classify an unknown instance. GRaCCE searches for good feature subsets and their associated weights. These feature weights are then multiplied with normalized feature values and these new values are used to calculate the distance between features.

著录项

作者

展开▼
作者单位

展开▼
年度 2002
页码 1-124
总页数 124
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Algorithms; Set theory; Accuracy; Theses; Weighting functions; Classification; Hypotheses;

机译：算法;集合论;准确性;论文;加权函数;分类;假设;

相似文献

外文文献
中文文献
专利

1. Efficient Genetic-Wrapper Algorithm Based Data Mining for Feature Subset Selection in a Power Quality Pattern Recognition Application [J] . Brahmadesam Krishna, Baskaran Kaliaperumal The international arab journal of information technology . 2011,第4期

机译：电能质量模式识别应用中基于遗传算法的数据挖掘特征子集选择
2. Classification Performance Improvement Using Random Subset Feature Selection Algorithm for Data Mining [J] . Lakshmipadmaja D, B. Vishnuvardhan Big Data Research . 2018,第1期

机译：用于数据挖掘的随机子集特征选择算法的分类性能改进
3. Mining High Dimensional Data Using Attribute Clustering-Based Feature Subset Selection Algorithm [J] . Vivek Ravindra Prasad Pandey, T.Venu, N.Subhash Chandra International Journal of Computer Trends and Technology . 2014,第2期

机译：使用基于属性聚类的特征子集选择算法挖掘高维数据
4. Weighting and Feature Selection on Gene-Expression Data by the Use of Genetic Algorithms [C] . Olga M. Perez, Manuel Hidalgo-Conde, Francisco J. Marin, 7th International Workshop-Conference on Artificial and Natural Neural Networks, IWANN 2003 Pt.2 Jun 3-6, 2003 Mao, Menorca, Spain . 2003

机译：利用遗传算法对基因表达数据进行加权和特征选择
5. Genetic Algorithms and Feature Subset Selection for Predicting Athletic Performance: Case of Professional Football. [D] . Cordes, Victor. 2016

机译：预测运动成绩的遗传算法和特征子集选择：以职业足球为例。
6. An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset [O] . Senthilkumar Devaraj, S. Paulraj 2015

机译：多维数据集分类的有效特征子集选择算法
7. Mining of High Dimensional Data using Efficient Feature Subset Selection Clustering Algorithm (WEKA) [O] . Lakshmi Sarika T, B Tarakeswara Rao, Ph. D, 2015

机译：使用高效特征子集选择聚类算法（WEKa）挖掘高维数据

Data Mining Feature Subset Weighting and Selection Using Genetic Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅