首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information
【24h】

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

机译:MIML-FCN +:通过具有特权信息的完全卷积网络进行多实例多标签学习

获取原文

摘要

Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning using PI only consider instance-level PI (privileged instances), they fail to make use of bag-level PI (privileged bags) available in MIML learning. Therefore, in this paper, we propose a two-stream fully convolutional network, named MIML-FCN+, unified by a novel PI loss to solve the problem of MIML learning with privileged bags. Compared to the previous works on PI, the proposed MIML-FCN+ utilizes the readily available privileged bags, instead of hard-to-obtain privileged instances, making the system more general and practical in real world applications. As the proposed PI loss is convex and SGD-compatible and the framework itself is a fully convolutional network, MIML FCN+ can be easily integrated with state-of-the-art deep learning networks. Moreover, the flexibility of convolutional layers allows us to exploit structured correlations among instances to facilitate more effective training and testing. Experimental results on three benchmark datasets demonstrate the effectiveness of the proposed MIML-FCN+, outperforming state-of-the-art methods in the application of multi-object recognition.
机译:多实例多标签(MIML)学习在计算机视觉中具有许多有趣的应用程序,包括多对象识别和自动图像标记。在这些应用程序中,在训练短语期间通常可以使用附加信息,例如边界框,图像标题和描述,这被称为特权信息(PI)。但是,由于有关使用PI学习的现有工作仅考虑实例级PI(特权实例),因此他们无法利用MIML学习中可用的袋级PI(特权包)。因此,在本文中,我们提出了一个两流全卷积网络,称为MIML-FCN +,由一种新型的PI损失统一起来,以解决使用特权袋进行MIML学习的问题。与先前关于PI的工作相比,拟议的MIML-FCN +使用了现成的特权包,而不是难以获得的特权实例,从而使该系统在实际应用中更加通用和实用。由于拟议的PI损失是凸的且与SGD兼容,并且框架本身是完全卷积的网络,因此MIML FCN +可以轻松地与最新的深度学习网络集成。此外,卷积层的灵活性使我们能够利用实例之间的结构化相关性来促进更有效的训练和测试。在三个基准数据集上的实验结果证明了所提出的MIML-FCN +的有效性,其在多对象识别应用中的表现优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号