首页> 外文会议>Pacific Rim international conference on artificial intelligence >Matrix Factorization for Identifying Noisy Labels of Multi-label Instances
【24h】

Matrix Factorization for Identifying Noisy Labels of Multi-label Instances

机译:用于识别多标签实例的嘈杂标签的矩阵分解

获取原文

摘要

Current effort on multi-label learning generally assumes that the given labels are noise-free. However, obtaining noise-free labels is quite difficult and often impractical. In this paper, we study how to identify a subset of relevant labels from a set of candidate ones given as annotations to instances, and introduce a matrix factorization based method called MF-INL. It first decomposes the original instance-label association matrix into two low-rank matrices using nonnegative matrix factorization with feature-based and label-based constraints to retain the geometric structure of instances and label correlations. MF-INL then reconstructs the association matrix using the product of the decomposed matrices, and identifies associations with the lowest confidence as noisy associations. An empirical study on real-world multi-label datasets with injected noisy labels shows that MF-INL can identify noisy labels more accurately than other related solutions and is robust to input parameters. We empirically demonstrate that both feature-based and label-based constraints contribute to boosting the performance of MF-INL.
机译:当前在多标签学习上的努力通常假设给定标签是无噪声的。然而,获得无噪声的标签是非常困难的,并且通常是不切实际的。在本文中,我们研究如何从作为实例注释给出的一组候选标签中识别相关标签的子集,并介绍一种称为MF-INL的基于矩阵分解的方法。首先使用具有基于特征和基于标签的约束的非负矩阵分解将原始的实例-标签关联矩阵分解为两个低阶矩阵,以保留实例和标签相关性的几何结构。然后,MF-INL使用分解后的矩阵的乘积重建关联矩阵,并将具有最低置信度的关联标识为嘈杂的关联。对带有注入噪声标签的现实世界多标签数据集的实证研究表明,MF-INL可以比其他相关解决方案更准确地识别噪声标签,并且对输入参数具有鲁棒性。我们经验证明,基于特征的约束和基于标签的约束都有助于提高MF-INL的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号