首页> 外文期刊>Journal of visual communication & image representation >Accurate bounding-box regression with distance-IoU loss for visual tracking
【24h】

Accurate bounding-box regression with distance-IoU loss for visual tracking

机译:用于视觉跟踪的具有距离 IoU 损失的精确边界框回归

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Most existing trackers are based on using a classifier and multi-scale estimation to estimate the target state. Consequently, and as expected, trackers have become more stable while tracking accuracy has stagnated. While trackers adopt a maximum overlap method based on an intersection-over-union (IoU) loss to mitigate this problem, there are defects in the IoU loss itself, that make it impossible to continue to optimize the objective function when a given bounding box is completely contained within/without another bounding box; this makes it very challenging to accurately estimate the target state. Accordingly, in this paper, we address the above-mentioned problem by proposing a novel tracking method based on a distance-IoU (DIoU) loss, such that the proposed tracker consists of target estimation and target classification. The target estimation part is trained to predict the DIoU score between the target ground-truth bounding-box and the estimated bounding-box. The DIoU loss can maintain the advantage provided by the IoU loss while minimizing the distance between the center points of two bounding boxes, thereby making the target estimation more accurate. Moreover, we introduce a classification part that is trained online and optimized with a Conjugate-Gradient based strategy to guarantee real-time tracking speed. Comprehensive experimental results demonstrate that the proposed method achieves competitive tracking accuracy when compared to state-of-the-art trackers while with a real-time tracking speed.
机译:大多数现有的跟踪器都基于使用分类器和多尺度估计来估计目标状态。因此,正如预期的那样,跟踪器变得更加稳定,而跟踪精度却停滞不前。虽然跟踪器采用基于交并并集(IoU)损失的最大重叠方法来缓解这一问题,但IoU损失本身存在缺陷,当给定的边界框完全包含在另一个边界框内/没有另一个边界框时,无法继续优化目标函数;这使得准确估计目标状态变得非常具有挑战性。因此,针对上述问题,本文提出了一种基于距离-IoU(DIoU)损失的跟踪方法,使得所提出的跟踪器由目标估计和目标分类组成。对目标估计部分进行训练,以预测目标地面实况边界框和估计边界框之间的 DIoU 分数。DIoU 损失可以保持 IoU 损失提供的优势,同时最小化两个边界框中心点之间的距离,从而使目标估计更加准确。此外,我们还引入了一个在线训练的分类部分,并使用基于共轭梯度的策略进行优化,以保证实时跟踪速度。综合实验结果表明,与最先进的跟踪器相比,所提方法在具有实时跟踪速度的同时,具有较强的跟踪精度。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号