Harnessing Side Information for Classification Under Label Noise

Wei Yang; Gong Chen; Chen Shuo; Liu Tongliang; Yang Jian; Tao Dacheng

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Harnessing Side Information for Classification Under Label Noise

【24h】

Harnessing Side Information for Classification Under Label Noise

机译：利用标签噪声分类的侧面信息

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Practical data sets often contain the label noise caused by various human factors or measurement errors, which means that a fraction of training examples might be mistakenly labeled. Such noisy labels will mislead the classifier training and severely decrease the classification performance. Existing approaches to handle this problem are usually developed through various surrogate loss functions under the framework of empirical risk minimization. However, they are only suitable for binary classification and also require strong prior knowledge. Therefore, this article treats the example features as side information and formulates the noisy label removal problem as a matrix recovery problem. We denote our proposed method as "label noise handling via side information" (LNSI). Specifically, the observed label matrix is decomposed as the sum of two parts, in which the first part reveals the true labels and can be obtained by conducting a low-rank mapping on the side information; and the second part captures the incorrect labels and is modeled by a row-sparse matrix. The merits of such formulation lie in three aspects: 1) the strong recovery ability of this strategy has been sufficiently demonstrated by intensive theoretical works on side information; 2) multi-class situations can be directly handled with the aid of learned projection matrix; and 3) only very weak assumptions are required for model design, making LNSI applicable to a wide range of practical problems. Moreover, we theoretically derive the generalization bound of LNSI and show that the expected classification error of LNSI is upper bounded. The experimental results on a variety of data sets including UCI benchmark data sets and practical data sets confirm the superiority of LNSI to state-of-the-art approaches on label noise handling.

机译：实际数据集通常包含由各种人类因素或测量误差引起的标签噪声，这意味着可能错误地标记了一小部分训练示例。这种嘈杂的标签将误导分类器培训并严重降低分类性能。处理此问题的现有方法通常是通过经验风险最小化框架下的各种替代损失功能而开发的。但是，它们仅适用于二进制分类，并且还需要强大的先验知识。因此，本文将示例特征视为侧面信息，并将噪声标签删除问题交给矩阵恢复问题。我们表示我们所提出的方法作为“通过侧面信息的标签噪声处理”（LNSI）。具体地，观察到的标签矩阵被分解为两个部分的总和，其中第一部分揭示了真实标签，并且可以通过在侧面信息上进行低秩映射来获得;第二部分捕获不正确的标签，并由行稀疏矩阵建模。这种制剂的优点在三个方面：1）通过侧面信息的密集理论作品充分证明了这一战略的强烈回收能力; 2）可以借助于学习的投影矩阵直接处理多级情况; 3）模型设计只需要非常弱的假设，使LNSI适用于各种实际问题。此外，我们理论上导出了LNSI的泛化范围，并显示LNSI的预期分类误差是上限。在包括UCI基准数据集和实际数据集的各种数据集的实验结果证实了LNSI对标签噪声处理的最先进方法的优越性。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on 》 |2020年第9期| 3178-3192| 共15页
作者
Wei Yang; Gong Chen; Chen Shuo; Liu Tongliang; Yang Jian; Tao Dacheng;
展开▼
作者单位

Nanjing Univ Sci & Technol Sch Comp Sci & Engn Nanjing 210094 Peoples R China|Xidian Univ State Key Lab Integrated Serv Networks Xian Peoples R China;

Nanjing Univ Sci & Technol Sch Comp Sci & Engn Nanjing 210094 Peoples R China|Xidian Univ State Key Lab Integrated Serv Networks Xian Peoples R China;

Nanjing Univ Sci & Technol Sch Comp Sci & Engn PCA Lab Nanjing 210094 Peoples R China|Nanjing Univ Sci & Technol Sch Comp Sci & Engn Key Lab Intelligent Percept & Syst High Dimens In Minist Educ Nanjing 210094 Peoples R China|Nanjing Univ Sci & Technol Sch Comp Sci & Engn Jiangsu Key Lab Image & Video Understanding Socia Nanjing 210094 Peoples R China;

Univ Sydney Fac Engn UBTECH Sydney Artificial Intelligence Ctr Sch Comp Sci Darlington NSW 2008 Australia;

Nanjing Univ Sci & Technol Sch Comp Sci & Engn PCA Lab Nanjing 210094 Peoples R China|Nanjing Univ Sci & Technol Sch Comp Sci & Engn Key Lab Intelligent Percept & Syst High Dimens In Minist Educ Nanjing 210094 Peoples R China|Nanjing Univ Sci & Technol Sch Comp Sci & Engn Jiangsu Key Lab Image & Video Understanding Socia Nanjing 210094 Peoples R China;

Univ Sydney Fac Engn UBTECH Sydney Artificial Intelligence Ctr Sch Comp Sci Darlington NSW 2008 Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Noise measurement; Matrix decomposition; Training; Computer science; Task analysis; Learning systems; Risk management; Classification; generalization bound; label noise; matrix recovery; side information;

机译：噪声测量;矩阵分解;培训;计算机科学;任务分析;学习系统;风险管理;分类;概括;标记噪声;矩阵恢复;矩阵恢复;矩阵恢复;矩阵恢复;矩阵恢复;矩阵恢复;方面信息;

相似文献

外文文献
中文文献
专利

1. A label-noise robust active learning sample collection method for multi-temporal urban land-cover classification and change analysis [J] . ISPRS Journal of Photogrammetry and Remote Sensing . 2020 ,第May期

机译：一种标签噪声鲁棒的主动学习样本采集方法，用于多时相城市土地覆盖分类和变化分析
2. Towards instance-dependent label noise-tolerant classification: a probabilistic approach [J] . Bootkrajang Jakramate, Chaijaruwanich Jeerayut Pattern Analysis and Applications . 2020 ,第1期

机译：走向依赖实例的标签耐噪分类：一种概率方法
3. Animal species classification using deep neural networks with noise labels [J] . Ecological informatics: an international journal on ecoinformatics and computational ecology . 2020 ,第期

机译：动物物种使用噪声标签的深神经网络进行分类
4. Learning Multi-Label Aerial Image Classification Under Label Noise: A Regularization Approach Using Word Embeddings [C] . Yuansheng Hua, Sylvain Lobry, Lichao Mou, International Geoscience and Remote Sensing Symposium . 2020

机译：在标签噪声下学习多标签的空中图像分类：使用Word Embeddings的正则化方法
5. Harnessing the flexibility of tubulin tyrosine ligase to site-specifically label C-terminus of alpha-tubulin. [D] . Banerjee, Abhijit. 2010

机译：利用微管蛋白酪氨酸连接酶的灵活性来位点特异性标记α-微管蛋白的C末端。
6. Effects of Label Noise on Deep Learning-Based Skin Cancer Classification [O] . Achim Hekler, Jakob N. Kather, Eva Krieghoff-Henning, 2020

机译：标签噪声对基于深度学习的皮肤癌分类的影响
7. Learning Multi-Label Aerial Image Classification Under Label Noise: A Regularization Approach Using Word Embeddings [O] . Yuansheng Hua, Sylvain Lobry, Lichao Mou, 2020

机译：在标签噪声下学习多标签的空中图像分类：使用Word Embeddings的正则化方法
8. Labels or Attributes. Rethinking the Neighbors for Collective Classification in Sparsely-Labeled Networks [R] . McDowell, L K, Aha, D W 2013

机译：标签或属性。重新划分稀疏标记网络中集体分类的邻居

Harnessing Side Information for Classification Under Label Noise

摘要

著录项

相似文献

相关主题

期刊订阅