首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Online multi-label streaming feature selection based on neighborhood rough set
【24h】

Online multi-label streaming feature selection based on neighborhood rough set

机译:基于邻域粗糙集的在线多标签流特征选择

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Multi-label feature selection has grabbed intensive attention in many big data applications. However, traditional multi-label feature selection methods generally ignore a real-world scenario, i.e., the features constantly flow into the model one by one over time. To address this problem, we develop a novel online multi-label streaming feature selection method based on neighborhood rough set to select a feature subset which contains strongly relevant and non-redundant features. The main motivation is that data mining based on neighborhood rough set does not require any priori knowledge of the feature space structure. Moreover, neighborhood rough set deals with mixed data without breaking the neighborhood and order structure of data. In this paper, we first introduce the maximum-nearest-neighbor of instance to granulate all instances which can solve the problem of granularity selection in neighborhood rough set, and then generalize neighborhood rough set in single-label to fit multi-label learning. Meanwhile, an online multi-label streaming feature selection framework, which includes online importance selection and online redundancy update, is presented. Under this framework, we propose a criterion to select the important features relative to the currently selected features, and design a bound on pairwise correlations between features under label set to filter out redundant features. An empirical study using a series of benchmark datasets demonstrates that the proposed method outperforms other state-of-the-art multi-label feature selection methods. (C) 2018 Elsevier Ltd. All rights reserved.
机译:多标签功能选择在许多大数据应用中抓住了强化关注。然而,传统的多标签特征选择方法通常忽略真实世界的场景,即,功能随着时间的推移,功能不断流入模型。为了解决这个问题,我们开发了一种基于邻域粗糙集的新型在线多标签流特征选择方法,以选择包含强相关和非冗余功能的特征子集。主要动机是基于邻域粗糙集的数据挖掘不需要任何先验特征空间结构的知识。此外,邻域粗糙集处理混合数据而不破坏邻域和数据的订单结构。在本文中,我们首先介绍最大邻居的实例来造粒,可以解决邻域粗糙集中的粒度选择问题,然后在单个标签中概括邻域粗糙集合以适应多标签学习。同时,呈现了一个在线多标签流特征选择框架,包括在线重要选择和在线冗余更新。在此框架下,我们提出了一个标准,以选择相对于当前所选功能的重要特征,并在标签集下的功能之间的成对相关性上设计绑定以过滤冗余功能。使用一系列基准数据集进行实证研究表明,所提出的方法优于其他最先进的多标签特征选择方法。 (c)2018年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号