MLCR: A Fast Multi-label Feature Selection Method Based on K-means and L2-norm

机译：MLCR：基于K-means和L2-Norm的快速多标签特征选择方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection is an essential step in data mining and machine learning that increases classification accuracy and reduces the computational time by eliminating redundant and unrelated features. In this paper, a fast feature selection algorithm is introduced based on clustering ranking in feature-label space and L2-norm, called MLCR. This method is a filter-based method for multi-label datasets. We used a two-step strategy for this method. First, we used the k-means algorithm to cluster the features based on their correlation with labels. Then we sorted the features in each cluster based on L2-norm in descending order and finally set rank to each feature. This will allow similar features to be grouped into one cluster. In the second step, the features with the same rank are sorted like the previous step and added to the feature ranking vector. To verify the efficiency of MLCR, we have compared the obtained results of this method with five well-known multi-label feature selection algorithms based on various real-world multilabel datasets in different dimensions. The results demonstrate that our proposed method outperforms the other methods in the classification measures and run-time.

机译：特征选择是数据挖掘和机器学习的重要步骤，可以通过消除冗余和不相关的功能来提高分类精度并降低计算时间。在本文中，基于特征标签空间和L2-NOM的聚类排名来引入快速特征选择算法，称为MLCR。该方法是用于多标签数据集的基于滤波器的方法。我们使用了这种方法的两步策略。首先，我们使用K-Means算法基于与标签的相关性来聚类特征。然后我们根据L2-Norm在降序中对每个群集的功能进行排序，最后将等级设置为每个功能。这将允许将类似的功能分组为一个群集。在第二步中，具有相同等级的特征如前一步骤方式，并添加到特征排名向量。为了验证MLCR的效率，我们已经将此方法的获得结果与五个众所周知的多标签特征选择算法进行了基于不同维度的各种实际多标签数据集。结果表明，我们所提出的方法优于分类措施和运行时的其他方法。

著录项

来源
《International Computer Conference, Computer Society of Iran》|2020年|1 v.|共7页
会议地点
作者
Amin Hashemi; Mohammad Bagher Dowlatshahi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类物理学;
关键词
data mining; feature selection; learning (artificial intelligence); pattern classification; pattern clustering;

机译：数据挖掘;特征选择;学习（人工智能）;模式分类;模式聚类;
入库时间 2022-08-20 20:21:36

相似文献

外文文献
中文文献
专利

1. Fast multi-label feature selection based on information-theoretic feature ranking [J] . Lee Jaesung, Kim Dae-Won Pattern Recognition: The Journal of the Pattern Recognition Society . 2015,第9期

机译：基于信息论特征排序的快速多标签特征选择
2. Kernel Penalized K-means: A feature selection method based on Kernel K-means [J] . Maldonado Sebastian, Carrizosa Emilio, Weber Richard Information Sciences: An International Journal . 2015,第Null期

机译：核罚K均值：一种基于核K均值的特征选择方法
3. Intelligent product-gene acquisition method based on K-means clustering and mutual information-based feature selection algorithm [J] . Li Pan, Ren Yanzhao, Yan Yan, Artificial intelligence for engineering design, analysis and manufacturing . 2019,第4期

机译：基于K均值聚类和互信息的特征选择算法的智能产品基因获取方法
4. MLCR: A Fast Multi-label Feature Selection Method Based on K-means and L2-norm [C] . Amin Hashemi, Mohammad Bagher Dowlatshahi International Computer Conference, Computer Society of Iran . 2020

机译：MLCR：一种基于K均值和L2-范数的快速多标签特征选择方法
5. Statistical model-based methods for observation selection in wireless sensor networks and for feature selection in classification. [D] . Qi, Qi. 2012

机译：基于统计模型的方法用于无线传感器网络中的观察选择和分类中的特征选择。
6. A PSO-based multi-objective multi-label feature selection method in classification [O] . Yong Zhang, Dun-wei Gong, Xiao-yan Sun, -1

机译：基于PSO的多目标多标签特征选择方法
7. New Multi-Label Correlation-Based Feature Selection Methods for Multi-Label Classification and Application in Bioinformatics [O] . Jungjit Suwimol 2016

机译：基于多标签相关性的多标签分类新特征选择方法及其在生物信息学中的应用

MLCR: A Fast Multi-label Feature Selection Method Based on K-means and L2-norm

摘要

著录项

相似文献

相关主题

期刊订阅