首页> 外文期刊>Information Sciences: An International Journal >Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems
【24h】

Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems

机译:Multilabel特征选择使用ML-Creieff和邻域互联信息的Multilabel邻里决策系统

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Feature selection as an essential preprocessing step in multilabel classification has been widely researched. Due to the diversity and complexity of multilabel datasets, some feature selection methods are unstable and yield low predictive accuracy. To address these issues, this paper presents a novel multilabel feature selection method using multilabel ReliefF (ML-ReliefF) and neighborhood mutual information in multilabel neighborhood decision systems. First, to solve the problem of the few available randomly selected samples when searching the nearest samples in ReliefF, the coefficient of difference and the average distance among the nearest similar and heterogeneous samples are introduced to evaluate the differences among the samples, and then the average differences among the similar or heterogeneous samples are defined. Using the Jaccard correlation coefficient, a new formula for updating feature weights is presented. Second, the margin of the sample is studied to granulate all samples under each label, and the concept of the neighborhood is given. By combining algebra with information views, some neighborhood entropy-based uncertainty measures for multilabel classification are investigated, and new neighborhood mutual information is proposed. Furthermore, an optimization objective function is constructed to evaluate the candidate features in multilabel neighborhood decision systems, all the properties are discussed, and the relationships of these measures are established. Finally, an improved ML-ReliefF algorithm is designed for preliminarily eliminating unrelated features to decrease the computational complexity for multilabel classification, and a heuristic forward multilabel feature selection algorithm is developed to remove redundant features and improve classification performance. Experimental results conducted on thirteen multilabel datasets to verify the effectiveness of the proposed algorithms in multilabel neighborhood decision systems are compared with representative methods. (C) 2020 Elsevier Inc. All rights reserved.
机译:专业选择作为多拉拉巴德分类的基本预处理步骤已被广泛研究。由于多标签数据集的多样性和复杂性,一些特征选择方法是不稳定的并且产生低预测精度。为了解决这些问题,本文提出了一种新的多标签特征选择方法,使用Multilabel邻域决策系统中的Multilabel Relieff(ML-Creieff)和邻域互联信息。首先,为了解决在Relieff中搜索最近的样本时,在搜索最近的样本时,解决了少数可用的样本的问题,引入了最接近的相似和异构样本中的平均距离以评估样品之间的差异,然后是平均值定义了相似或异质样本之间的差异。使用Jaccard相关系数,呈现用于更新要素权重的新公式。其次,研究样品的裕度被研究以造粒在每个标签下的所有样品,并且给出邻域的概念。通过将代数与信息视图结合起来,研究了一些基于社区熵的多标签分类的不确定性措施,并提出了新的邻居互信息。此外,构造优化目标函数以评估多拉拉带邻域决策系统中的候选特征,讨论所有属性,并且建立了这些措施的关系。最后,设计了一种改进的ML-Creieff算法,用于预先消除不相关的特征,以降低多标签分类的计算复杂度,并且开发了一种启发式前向多标签特征选择算法以消除冗余特征并提高分类性能。在十三个多标签数据集中进行的实验结果,以验证多拉布尔邻里决策系统中所提出的算法的有效性与代表方法进行了比较。 (c)2020 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号