首页> 外文期刊>Journal of visual communication & image representation >Design of anchor boxes and data augmentation for transformer-based vehicle localization
【24h】

Design of anchor boxes and data augmentation for transformer-based vehicle localization

机译:基于变压器的车辆定位的锚盒设计和数据增强

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Vehicle localization is an important task in the signal processing field. In recent years, context exploration has been widely studied, especially the nonlocal dependencies in an image, using, for example, attention and transformer mechanisms. However, these approaches encounter difficulties in achieving accurate localization owing to ineffective design and use of queries. Motivated by the fact that spatial information is determined by decoder embeddings and details of reference boxes, we propose a method of explicitly and dynamically modeling anchor boxes in the query generation module. Moreover, we design a geometry-aware data augmentation approach to increase the diversity of the data by employing multiple augmentation methods on an image. Experiments conducted on public datasets show that our approach can improve the average precision by approximately 1.1.
机译:车辆定位是信号处理领域的一项重要任务。近年来,上下文探索已被广泛研究,尤其是图像中的非局部依赖关系,例如使用注意力和转换器机制。然而,由于查询的设计和使用效率低下,这些方法在实现准确本地化方面遇到了困难。基于空间信息由解码器嵌入和参考框细节决定的这一特点,提出了一种在查询生成模块中对锚框进行显式动态建模的方法。此外,我们设计了一种几何感知的数据增强方法,通过在图像上采用多种增强方法来增加数据的多样性。在公共数据集上进行的实验表明,我们的方法可以将平均精度提高约1.1%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号