首页> 外文会议>IEEE International Conference on Computer Vision >Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection
【24h】

Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection

机译:面向计算型婴儿学习:对象检测的弱监督方法

获取原文

摘要

Intuitive observations show that a baby may inherently possess the capability of recognizing a new visual concept (e.g., chair, dog) by learning from only very few positive instances taught by parent(s) or others, and this recognition capability can be gradually further improved by exploring and/or interacting with the real instances in the physical world. Inspired by these observations, we propose a computational model for weakly-supervised object detection, based on prior knowledge modelling, exemplar learning and learning with video contexts. The prior knowledge is modeled with a pre-trained Convolutional Neural Network (CNN). When very few instances of a new concept are given, an initial concept detector is built by exemplar learning over the deep features the pre-trained CNN. The well-designed tracking solution is then used to discover more diverse instances from the massive online weakly labeled videos. Once a positive instance is detected/identified with high score in each video, more instances possibly from different view-angles and/or different distances are tracked and accumulated. Then the concept detector can be fine-tuned based on these new instances. This process can be repeated again and again till we obtain a very mature concept detector. Extensive experiments on Pascal VOC-07/10/12 object detection datasets [9] well demonstrate the effectiveness of our framework. It can beat the state-of-the-art full-training based performances by learning from very few samples for each object category, along with about 20,000 weakly labeled videos.
机译:直观的观察表明,通过从父母或他人教过的很少的积极实例中学习,婴儿可能固有地具有识别新视觉概念(例如,椅子,狗)的能力,并且这种识别能力可以逐渐得到进一步提高。通过探索和/或与物理世界中的真实实例进行交互。受这些观察的启发,我们基于先验知识建模,样例学习和视频上下文学习,提出了一种用于弱监督目标检测的计算模型。先验知识是使用预训练的卷积神经网络(CNN)进行建模的。当给出新概念的实例很少时,通过对预训练的CNN的深层特征进行示例学习来构建初始概念检测器。经过精心设计的跟踪解决方案然后用于从大量的在线弱标记视频中发现更多不同的实例。一旦在每个视频中以高分检测/识别出一个阳性实例,就可以跟踪和累积更多可能来自不同视角和/或不同距离的实例。然后,可以基于这些新实例对概念检测器进行微调。可以一次又一次地重复此过程,直到获得非常成熟的概念检测器。在Pascal VOC-07 / 10/12对象检测数据集上进行的大量实验[9]很好地证明了我们框架的有效性。通过为每个对象类别从很少的样本中学习大约20,000个弱标记的视频,它可以击败基于最新的完整训练的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号