...
首页> 外文期刊>IEICE transactions on information and systems >READER: Robust Semi-Supervised Multi-Label Dimension Reduction
【24h】

READER: Robust Semi-Supervised Multi-Label Dimension Reduction

机译:读者:稳健的半监督多标签降维

获取原文

摘要

Multi-label classification is an appealing and challenging supervised learning problem, where multiple labels, rather than a single label, are associated with an unseen test instance. To remove possible noises in labels and features of high-dimensionality, multi-label dimension reduction has attracted more and more attentions in recent years. The existing methods usually suffer from several problems, such as ignoring label outliers and label correlations. In addition, most of them emphasize on conducting dimension reduction in an unsupervised or supervised way, therefore, unable to utilize the label information or a large amount of unlabeled data to improve the performance. In order to cope with these problems, we propose a novel method termed R obust sE mi-supervised multi-lA bel D imE nsion R eduction, shortly READER. From the viewpoint of empirical risk minimization, READER selects most discriminative features for all the labels in a semi-supervised way. Specifically, the ?_(2,1)-norm induced loss function and regularization term make READER robust to the outliers in the data points. READER finds a feature subspace so as to keep originally neighbor instances close and embeds labels into a low-dimensional latent space nonlinearly. To optimize the objective function, an efficient algorithm is developed with convergence property. Extensive empirical studies on real-world datasets demonstrate the superior performance of the proposed method.
机译:多标签分类是一个吸引人且具有挑战性的监督学习问题,其中多个标签而不是单个标签与一个看不见的测试实例相关联。为了消除标签中的可能的噪音和高尺寸特征,近年来,多标签尺寸减小已引起越来越多的关注。现有方法通常遭受若干问题,例如忽略标签离群值和标签相关性。另外,它们中的大多数都强调以无监督或有监督的方式进行尺寸减小,因此,无法利用标签信息或大量未标签数据来改善性能。为了解决这些问题,我们提出了一种新的方法,称为R b obust s E b mi监督多l A b bel D im im b e nsion R ,不久是READER。从最小化经验风险的角度来看,READER以半监督方式为所有标签选择最具区分性的特征。具体来说,?_(2,1)-范数引发的损失函数和正则项使READER对数据点中的异常值具有鲁棒性。 READER找到一个特征子空间,以使原始邻居实例保持接近,并将标签非线性地嵌入到低维潜在空间中。为了优化目标函数,开发了一种具有收敛性的高效算法。对现实世界数据集的大量经验研究证明了该方法的优越性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号