首页> 外文会议>International Computer Conference, Computer Society of Iran >MLCR: A Fast Multi-label Feature Selection Method Based on K-means and L2-norm
【24h】

MLCR: A Fast Multi-label Feature Selection Method Based on K-means and L2-norm

机译:MLCR:基于K-means和L2-Norm的快速多标签特征选择方法

获取原文

摘要

Feature selection is an essential step in data mining and machine learning that increases classification accuracy and reduces the computational time by eliminating redundant and unrelated features. In this paper, a fast feature selection algorithm is introduced based on clustering ranking in feature-label space and L2-norm, called MLCR. This method is a filter-based method for multi-label datasets. We used a two-step strategy for this method. First, we used the k-means algorithm to cluster the features based on their correlation with labels. Then we sorted the features in each cluster based on L2-norm in descending order and finally set rank to each feature. This will allow similar features to be grouped into one cluster. In the second step, the features with the same rank are sorted like the previous step and added to the feature ranking vector. To verify the efficiency of MLCR, we have compared the obtained results of this method with five well-known multi-label feature selection algorithms based on various real-world multilabel datasets in different dimensions. The results demonstrate that our proposed method outperforms the other methods in the classification measures and run-time.
机译:特征选择是数据挖掘和机器学习的重要步骤,可以通过消除冗余和不相关的功能来提高分类精度并降低计算时间。在本文中,基于特征标签空间和L2-NOM的聚类排名来引入快速特征选择算法,称为MLCR。该方法是用于多标签数据集的基于滤波器的方法。我们使用了这种方法的两步策略。首先,我们使用K-Means算法基于与标签的相关性来聚类特征。然后我们根据L2-Norm在降序中对每个群集的功能进行排序,最后将等级设置为每个功能。这将允许将类似的功能分组为一个群集。在第二步中,具有相同等级的特征如前一步骤方式,并添加到特征排名向量。为了验证MLCR的效率,我们已经将此方法的获得结果与五个众所周知的多标签特征选择算法进行了基于不同维度的各种实际多标签数据集。结果表明,我们所提出的方法优于分类措施和运行时的其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号