...
首页> 外文期刊>Journal of visual communication & image representation >Bounding box regression with balance for harmonious object detection
【24h】

Bounding box regression with balance for harmonious object detection

机译:具有平衡的边界框回归,用于和谐的物体检测

获取原文
获取原文并翻译 | 示例

摘要

Localization is an essential part of object detection, which is usually accomplished by bounding box regression guided by en-norm-based or IoU-based loss functions, where IoU is known for its scale-invariant characteristics. However, introducing the scale-invariance into regression loss in traditional IoU-based methods may result in a bias in favor of smaller boxes and cause redundancy and unstable oscillations. To make up for these shortages of IoU-based losses, we propose a Scale-Balanced Factor (SF) that stabilizes the regression process via a simple adaptive factor. Furthermore, to compensate for the imbalance of different types of losses caused by SF and other IoU-based loss functions, regression losses are always multiplied by a hyperparameter, which is purely empirical and is hard to find an optimum. To address this issue, a Multi-Task Reinforced Equilibrium (MRE) is proposed to dynamically tweak the learning rate of each task based on reinforcement learning. The MRE can guarantee more balanced parameters and maximize the benefit of SF or other improvement methods for IoU. By incorporating the proposed SF and MRE into the classic detectors (RetinaNet, YOLO, and Faster R-CNN, etc.), we have achieved significant performance gains on MS COCO (0.8 AP similar to 1.9 AP) and PASCAL VOC (0.6 AP similar to 2.2 AP).
机译:定位是目标检测的重要组成部分,通常通过基于范数或基于 IoU 的损失函数引导的边界框回归来实现,其中 IoU 以其尺度不变特性而闻名。然而,在传统的基于 IoU 的方法中,将尺度不变性引入回归损失可能会导致偏向于较小的盒子,并导致冗余和不稳定振荡。为了弥补这些基于IoU的损失的不足,我们提出了一种规模平衡因子(SF),它通过一个简单的自适应因子来稳定回归过程。此外,为了补偿由SF和其他基于IoU的损失函数引起的不同类型损失的不平衡,回归损失总是乘以一个超参数,这纯粹是经验性的,很难找到一个最优值。针对这一问题,该文提出一种多任务强化均衡(Multi-Task Reinforced Equilibrium,MRE)算法,在强化学习的基础上动态调整每个任务的学习率。MRE可以保证参数更加平衡,并最大限度地发挥SF或其他IoU改进方法的效益。通过将所提出的SF和MRE整合到经典检测器(RetinaNet、YOLO和Faster R-CNN等)中,我们在MS COCO(0.8 AP与1.9 AP相似)和PASCAL VOC(0.6 AP与2.2 AP相似)上实现了显著的性能提升。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号