首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory
【24h】

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

机译:UberNet:使用各种数据集和有限的内存为低,中和高水平视觉训练通用卷积神经网络

获取原文

摘要

In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture. Such a network can act like a swiss knife for vision tasks, we call it an UberNet to indicate its overarching nature. The main contribution of this work consists in handling challenges that emerge when scaling up to many tasks. We introduce techniques that facilitate (i) training a deep architecture while relying on diverse training sets and (ii) training many (potentially unlimited) tasks with a limited memory budget. This allows us to train in an end-to-end manner a unified CNN architecture that jointly handles (a) boundary detection (b) normal estimation (c) saliency estimation (d) semantic segmentation (e) human part segmentation (f) semantic boundary detection, (g) region proposal generation and object detection. We obtain competitive performance while jointly addressing all tasks in 0.7 seconds on a GPU. Our system will be made publicly available.
机译:在这项工作中,我们以端到端的方式训练卷积神经网络(CNN),该网络在统一体系结构中共同处理低,中和高级视觉任务。这样的网络可以充当视觉任务的瑞士刀,我们称其为UberNet,以表明其总体性质。这项工作的主要贡献在于应对在扩展到许多任务时出现的挑战。我们介绍了一些技术,这些技术有助于(i)依靠各种培训集来训练深度的体系结构,以及(ii)在有限的内存预算下训练许多(可能是无限的)任务。这使我们能够以端到端的方式训练统一的CNN架构,该架构可以共同处理(a)边界检测(b)正常估计(c)显着性估计(d)语义分割(e)人身分割(f)语义边界检测,(g)区域建议生成和目标检测。我们获得了出色的性能,同时在GPU上以0.7秒的速度共同解决了所有任务。我们的系统将公开提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号