【24h】

CRAFT Objects from Images

机译:从图像中制作对象

获取原文

摘要

Object detection is a fundamental problem in image understanding. One popular solution is the R-CNN framework [15] and its fast versions [14, 27]. They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals from images, 2) classifying proposals into various object categories. Despite that we are handling with two relatively easier tasks, they are not solved perfectly and there's still room for improvement. In this paper, we push the "divide and conquer" solution even further by dividing each task into two sub-tasks. We call the proposed method "CRAFT" (Cascade Regionproposal-network And FasT-rcnn), which tackles each task with a carefully designed network cascade. We show that the cascade structure helps in both tasks: in proposal generation, it provides more compact and better localized object proposals, in object classification, it reduces false positives (mainly between ambiguous categories) by capturing both inter-and intra-category variances. CRAFT achieves consistent and considerable improvement over the state-of the-art on object detection benchmarks like PASCAL VOC 07/12 and ILSVRC.
机译:目标检测是图像理解中的基本问题。一种流行的解决方案是R-CNN框架[15]及其快速版本[14,27]。他们将对象检测问题分解为两个更简单的级联任务:1)从图像生成对象建议,2)将建议分类为各种对象类别。尽管我们正在处理两个相对简单的任务,但它们并不能完美解决,还有改进的余地。在本文中,我们通过将每个任务划分为两个子任务来进一步推动“分而治之”的解决方案。我们将所提出的方法称为“ CRAFT”(级联区域提案网络和FasT-rcnn),该方法通过精心设计的网络级联来解决每个任务。我们显示出级联结构在这两个任务中都有帮助:在提案生成中,它提供了更紧凑,更好的本地化对象提案;在对象分类中,它通过捕获类别间和类别内差异减少了误报(主要是在模棱两可的类别之间)。与先进的对象检测基准(例如PASCAL VOC 07/12和ILSVRC)相比,CRAFT取得了一致且显着的进步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号