首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory
【24h】

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

机译:Ubernet:使用不同数据集和有限的存储器培训用于低,中和高级视觉的通用卷积神经网络

获取原文

摘要

In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture. Such a network can act like a swiss knife for vision tasks, we call it an UberNet to indicate its overarching nature. The main contribution of this work consists in handling challenges that emerge when scaling up to many tasks. We introduce techniques that facilitate (i) training a deep architecture while relying on diverse training sets and (ii) training many (potentially unlimited) tasks with a limited memory budget. This allows us to train in an end-to-end manner a unified CNN architecture that jointly handles (a) boundary detection (b) normal estimation (c) saliency estimation (d) semantic segmentation (e) human part segmentation (f) semantic boundary detection, (g) region proposal generation and object detection. We obtain competitive performance while jointly addressing all tasks in 0.7 seconds on a GPU. Our system will be made publicly available.
机译:在这项工作中,我们以端到端的方式训练一个卷积神经网络(CNN),该网络(CNN)共同处理统一架构中的低级,中和高级视觉任务。这样的网络可以像瑞士刀一样用于愿景任务,我们称之为Ubernet以表明其总体性质。这项工作的主要贡献包括处理在扩大到许多任务时出现的挑战。我们介绍了促进(i)培训深层建筑的技术,同时依靠多样化的培训集和(ii)培训许多(潜在无限)的内存预算的任务。这使我们能够以端到端的方式训练一个统一的CNN架构,该统一的CNN架构共同处理(a)边界检测(b)正常估计(c)显着估计(d)语义分割(e)人体分段(f)语义边界检测,(g)区域提案生成和对象检测。我们获得竞争性能,同时在GPU上联合解决0.7秒的所有任务。我们的系统将公开可用。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号