Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization

Li Niu; Wen Li; Dong Xu; Jianfei Cai

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization

【24h】

Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization

机译：通过弱监督域泛化从Web数据中学习来进行视觉识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, a weakly supervised domain generalization (WSDG) method is proposed for real-world visual recognition tasks, in which we train classifiers by using Web data (e.g., Web images and Web videos) with noisy labels. In particular, two challenging problems need to be solved when learning robust classifiers, in which the first issue is to cope with the label noise of training Web data from the source domain, while the second issue is to enhance the generalization capability of learned classifiers to an arbitrary target domain. In order to handle the first problem, the training samples within each category are partitioned into clusters, where we use one bag to denote each cluster and instances to denote the samples in each cluster. Then, we identify a proportion of good training samples in each bag and train robust classifiers by using the good training samples, which leads to a multi-instance learning (MIL) problem. In order to handle the second problem, we assume that the training samples possibly form a set of hidden domains, with each hidden domain associated with a distinctive data distribution. Then, for each category and each hidden latent domain, we propose to learn one classifier by extending our MIL formulation, which leads to our WSDG approach. In the testing stage, our approach can obtain better generalization capability by effectively integrating multiple classifiers from different latent domains in each category. Moreover, our WSDG approach is further extended to utilize additional textual descriptions associated with Web data as privileged information (PI), although testing data do not have such PI. Extensive experiments on three benchmark data sets indicate that our newly proposed methods are effective for real-world visual recognition tasks by learning from Web data.

机译：本文针对现实世界中的视觉识别任务提出了一种弱监督域综合（WSDG）方法，其中我们通过使用带有噪声标签的Web数据（例如Web图像和Web视频）来训练分类器。特别是在学习鲁棒的分类器时，需要解决两个具有挑战性的问题，其中第一个问题是应对来自源域的训练Web数据的标签噪声，而第二个问题是增强学习的分类器的泛化能力，以解决这些问题。任意目标域。为了解决第一个问题，将每个类别中的训练样本分为几类，在这里我们使用一个袋子来表示每个类，并使用实例来表示每个类中的样本。然后，我们在每个袋子中确定一部分良好的训练样本，并通过使用良好的训练样本来训练鲁棒的分类器，这将导致多实例学习（MIL）问题。为了处理第二个问题，我们假设训练样本可能形成一组隐藏域，每个隐藏域都与独特的数据分布相关联。然后，对于每个类别和每个隐藏的潜在领域，我们建议通过扩展MIL公式来学习一个分类器，从而得出WSDG方法。在测试阶段，我们的方法可以通过有效地集成每个类别中不同潜域的多个分类器来获得更好的泛化能力。而且，我们的WSDG方法得到了进一步扩展，以利用与Web数据关联的其他文本描述作为特权信息（PI），尽管测试数据没有这样的PI。在三个基准数据集上进行的大量实验表明，通过从Web数据中学习，我们新提出的方法对于现实世界中的视觉识别任务是有效的。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2017年第9期|1985-1999|共15页
作者
Li Niu; Wen Li; Dong Xu; Jianfei Cai;
展开▼
作者单位

Interdisciplinary Graduate School, Nanyang Technological University, Singapore;

Computer Vision Laboratory, ETH Zürich, Zürich, Switzerland;

The University of Sydney, Sydney, NSW, Australia;

School of Computer Engineering, Nanyang Technological University, Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Testing; Visualization; Robustness; Support vector machines; Videos; Training data;

机译：培训;测试;可视化;稳健性;支持向量机;视频;培训数据;

相似文献

外文文献
中文文献
专利

1. Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition [J] . Fan Zhu, Ling Shao International Journal of Computer Vision . 2014,第1a2期

机译：弱监督跨域字典学习的视觉识别
2. Weakly supervised scale-invariant learning of models for visual recognition [J] . Fergus R, Perona P, Zisserman A International Journal of Computer Vision . 2007,第3期

机译：用于视觉识别的模型的弱监督尺度不变学习
3. Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition [J] . R. Fergus, P. Perona, A. Zisserman International Journal of Computer Vision . 2007,第3期

机译：视觉识别模型的弱监督尺度不变学习
4. Visual recognition by learning from web data: A weakly supervised domain generalization approach [C] . Li Niu, Wen Li, Dong Xu IEEE Conference on Computer Vision and Pattern Recognition . 2015

机译：通过网络数据学习视觉识别：弱监督域泛化方法
5. Visual Learning with Weak Supervision [D] . Cicek, Bayram Safa. 2021

机译：弱势监督视觉学习
6. Robust Semi-Supervised Traffic Sign Recognition via Self-Training and Weakly-Supervised Learning [O] . Obed Tettey Nartey, Guowu Yang, Sarpong Kwadwo Asare, 2020

机译：通过自我训练和弱监督学习实现可靠的半监督交通标志识别
7. Attend in groups: a weakly-supervised deep learning framework for learning from web data [O] . Zhuang, Bohan, Liu, Lingqiao, Li, Yao, 2016

机译：参加小组讨论：一个弱监督的深度学习框架从网络数据中学习

Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅