...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Noisy multi-label semi-supervised dimensionality reduction
【24h】

Noisy multi-label semi-supervised dimensionality reduction

机译:嘈杂的多标签半监督维度减少

获取原文
获取原文并翻译 | 示例

摘要

Noisy labeled data represent a rich source of information that often are easily accessible and cheap to obtain, but label noise might also have many negative consequences if not accounted for. How to fully utilize noisy labels has been studied extensively within the framework of standard supervised machine learning over a period of several decades. However, very little research has been conducted on solving the challenge posed by noisy labels in non-standard settings. This includes situations where only a fraction of the samples are labeled (semi-supervised) and each high-dimensional sample is associated with multiple labels. In this work, we present a novel semi-supervised and multi-label dimensionality reduction method that effectively utilizes information from both noisy multi-labels and unlabeled data. With the proposed Noisy multi-label semi-supervised dimensionality reduction (NMLSDR) method, the noisy multi-labels are denoised and unlabeled data are labeled simultaneously via a specially designed label propagation algorithm. NMLSDR then learns a projection matrix for reducing the dimensionality by maximizing the dependence between the enlarged and denoised multi-label space and the features in the projected space. Extensive experiments on synthetic data, benchmark datasets, as well as a real-world case study, demonstrate the effectiveness of the proposed algorithm and show that it outperforms state-of-the-art multi-label feature extraction algorithms. (C) 2019 Elsevier Ltd. All rights reserved.
机译:嘈杂的标记数据代表丰富的信息来源,通常可以轻松访问,但如果未占用,则标签噪声也可能具有许多负面后果。如何在几十年的标准监督机器学习框架内广泛地研究了噪音标签。然而,在解决非标准设置中的噪声标签所带来的挑战时,已经进行了很少的研究。这包括仅标记样本的一部分(半监督)和每个高维样本与多个标签相关的情况。在这项工作中,我们提出了一种新型半监督和多标签维度减少方法,有效利用来自嘈杂的多标签和未标记数据的信息。利用所提出的嘈杂多标签半监督维度减少(NMLSDR)方法,通过特殊设计的标签传播算法同时标记嘈杂的多标签和未标记的数据。然后,NMLSDR通过最大化扩大和去噪多标签空间与投影空间中的功能之间的依赖性来了解投影矩阵以降低维度。对合成数据,基准数据集以及真实世界的案例研究的广泛实验证明了所提出的算法的有效性,并表明它优于最先进的多标签特征提取算法。 (c)2019年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号