A new graph feature selection approach

机译：一个新的图表特征选择方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection (FS) is a very important pre-processing technique in machine learning and data mining. It aims to select a small subset of relevant and informative features from the original feature space that may contain many irrelevant, redundant and noisy features. Feature selection usually leads to better performance, interpretability, and lower computational cost. In the literature, FS methods are categorized into three main approaches: Filters, Wrappers, and Embedded. In this paper we introduce a new feature selection method called graph feature selection (GFS). The main steps of GFS are the following: first, we create a weighted graph where each node corresponds to each feature and the weight between two nodes is computed using a matrix of individual and pairwise score of a Decision tree classifier. Second, at each iteration, we split the graph into two random partitions having the same number of nodes, then we keep moving the worst node from one partition to another until the global modularity is converged. Third, from the final best partition, we select the best ranked features according to a new proposed variable importance criterion. The results of GFS are compared to three well-known feature selection algorithms using nine benchmarking datasets. The proposed method shows its ability and effectiveness at identifying the most informative feature subset.

机译：特征选择（FS）是机器学习和数据挖掘中的一个非常重要的预处理技术。它旨在从原始特征空间中选择一个小型相关和信息特征的小组集，这些功能可能包含许多无关，冗余和嘈杂的功能。特征选择通常会导致更好的性能，可解释性和较低的计算成本。在文献中，FS方法分为三种主要方法：过滤器，包装器和嵌入式。在本文中，我们介绍了一种名为agapress选择（GFS）的新功能选择方法。 GFS的主要步骤如下：首先，我们创建一个加权图，其中每个节点对应于每个特征，并且使用决策树分类器的个体矩阵来计算两个节点之间的权重。其次，在每次迭代中，我们将图形拆分为具有相同数量的节点的两个随机分区，然后我们将最差节点从一个分区移动到另一个分区，直到全局模块融合到另一个分区。三，从最终的最佳分区，我们根据新的拟议变量重要性标准选择最佳排名功能。将GFS的结果与使用九个基准数据集的三个众所周知的特征选择算法进行比较。该方法在识别最佳信息特征子集时显示其能力和有效性。

著录项

来源
《IEEE Congress on Information Science and Technology》|2021年|156-161|共6页
会议地点
作者
Yassine Akhiat; Youssef Asnaoui; Mohamed Chahhou; Ahmed Zinedine;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Information science; Machine learning; Feature extraction; Partitioning algorithms; Noise measurement; Decision trees; Data mining;

机译：信息科学;机器学习;特征提取;分区算法;噪声测量;决策树;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Compensation of feature selection biases accompanied with improved predictive performance for binary classification by using a novel ensemble feature selection approach [J] . Ursula Neumann, Mona Riemenschneider, Jan-Peter Sowa, BioData Mining . 2016,第1期

机译：通过使用一种新颖的整体特征选择方法来补偿特征选择偏差并改善二进制分类的预测性能
2. Optimal feature selection for SAR image classification using biogeography-based optimization (BBO), artificial bee colony (ABC) and support vector machine (SVM): a combined approach of optimization and machine learning [J] . Rostami Omid, Kaveh Mehrdad Computational Geosciences . 2021,第3期

机译：基于生物地理的优化（BBO），人造蜜蜂（ABC）和支持向量机（SVM）的SAR图像分类的最佳特征选择：优化和机器学习的组合方法
3. A graph approach for fuzzy-rough feature selection [J] . Chen Jinkun, Mi Jusheng, Lin Yaojin Fuzzy sets and systems . 2020,第Jul15期

机译：模糊粗糙特征选择的图形方法
4. Distributed Rough Set Based Feature Selection Approach to Analyse Deep and Hand-crafted Features for Mammography Mass Classification [C] . Azam Hamidinekoo, Zaineb Chelly Dagdia, Zobia Suhail, IEEE International Conference on Big Data . 2018

机译：基于分布粗糙集的特征选择方法，用于分析乳腺X射线摄影质量分类的深层和手工特征
5. Kaizen Programming with Enhanced Feature Discovery: An Automated Approach to Feature Selection and Feature Discovery for Prediction Models [D] . Stelmack, John. 2020

机译：Kaizen编程，具有增强功能发现：用于预测模型的特征选择和特征发现的自动方法
6. Compensation of feature selection biases accompanied with improved predictive performance for binary classification by using a novel ensemble feature selection approach [O] . Ursula Neumann, Mona Riemenschneider, Jan-Peter Sowa, 2016

机译：通过使用新颖的集成特征选择方法补偿特征选择偏差并改善二进制分类的预测性能
7. Table 1: F1 score () obtained using term frequency based feature selection approach and multivariate filter feature selection approach (SemEval Dataset). [O] . -1

机译：表1：使用基于术语频率的特征选择方法和多变量滤波器特征选择方法（Semeval DataSet）获得的F1得分（％）。

A new graph feature selection approach

摘要

著录项

相似文献

相关主题

期刊订阅