CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving

Hong Dza-Shiang; Chen Hung-Hao; Hsiao Pei-Yung; Fu Li-Chen; Siao Siang-Min

首页> 外文期刊>Image and Vision Computing >CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving

【24h】

CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving

机译：Crossfusion Net：基于RGB图像和自动驾驶点云的深度3D对象检测

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In recent years, accurate 3D detection plays an important role in a lot of applications. Autonomous driving, for instance, is one of typical representatives. This paper aims to design an accurate 3D detector that takes both LiDAR point clouds and RGB images as inputs according to the fact that both LiDAR and camera have their own merits. A deep novel end-to-end two-stream learnable architecture, CrossFusion Net, is designed to exploit features from both LiDAR point clouds as well as RGB images through a hierarchical fusion structure. Specifically, CrossFusion Net utilizes bird's eye view (BEV) of point clouds through projection. Besides, these two feature maps of different streams are fused through the newly introduced CrossFusion(CF) layer. The proposed CF layer transforms feature maps of one stream to another based on the spatial relationship between the BEV and RGB images. Additionally, we apply attention mechanism on the transformed feature map and the original one to automatically decide the importance of the two feature maps from the two sensors. Experiments on the challenging KITTI car 3D detection benchmark and BEV detection benchmark show that the presented approach outperforms the other state-of-the-art methods in average precision(AP), specifically, as well as outperforms UberATG-ContFuse [3] of 8% AP in moderate 3D car detection. Furthermore, the proposed network learns an effective representation in perception of circumstances via RGB feature maps and BEV feature maps. (C) 2020 Elsevier B.V. All rights reserved.

机译：近年来，精确的3D检测在很多应用中起着重要作用。例如，自动驾驶是典型的代表之一。本文旨在设计一个精确的3D探测器，可根据激光雷达和相机具有自己的优点，将LIDAR点云和RGB图像作为输入。深入的新型端到端两流学习架构，共产网络，旨在通过分层融合结构来利用LIDAR点云的功能以及RGB图像。具体而言，通过投影利用点云的鸟瞰图（BEV）。此外，这些不同流的两个特征映射通过新引入的共燃烧（CF）层融合。所提出的CF层基于BEV和RGB图像之间的空间关系将一个流的特征映射转换为另一流。此外，我们在转换的特征映射上应用注意机制，原始的机制自动决定两个传感器的两个特征映射的重要性。具有挑战性的基蒂汽车3D检测基准和BEV检测基准的实验表明，所提出的方法在平均精度（AP）中占据了另一种最先进的方法，具体而言，以及8的优势uberatg-contfuse [3]中等3D汽车检测中的％AP。此外，所提出的网络通过RGB特征映射和BEV特征映射来了解对情节感知的有效表示。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Image and Vision Computing》 |2020年第8期|103955.1-103955.9|共9页
作者
Hong Dza-Shiang; Chen Hung-Hao; Hsiao Pei-Yung; Fu Li-Chen; Siao Siang-Min;
展开▼
作者单位

Natl Taiwan Univ Dept Comp Sci & Informat Engn Taipei Taiwan;

Natl Taiwan Univ Dept Comp Sci & Informat Engn Taipei Taiwan;

Natl Univ Kaohsiung Dept Elect Engn Kaohsiung Taiwan;

Natl Taiwan Univ Dept Comp Sci & Informat Engn Taipei Taiwan;

Automot Res & Testing Ctr Touliu Yunlin Taiwan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep learning; 3D object detection; Data fusion; Autonomous driving;

机译：深度学习;3D对象检测;数据融合;自主驾驶;

相似文献

外文文献
中文文献
专利

1. 3D OBJECT DETECTION AND RECOGNITION FOR ROBOTIC GRASPING BASED ON RGB-D IMAGES AND GLOBAL FEATURES [J] . Witold CZAJEWSKI, Krzysztof KOLOMYJEC Foundations of computing and decision sciences . 2017,第3期

机译：基于RGB-D图像和全局特征的机器人抓取3D对象检测与识别
2. DeepRange: deep-learning-based object detection and ranging in autonomous driving [J] . Parmar Yashrajsinh, Natarajan Sudha, Sobha Gayathri Intelligent Transport Systems, IET . 2019,第8期

机译：DeepRange：自动驾驶中基于深度学习的对象检测和测距
3. 3D object detection based on synthetic RGB image [J] . Chao Xu, Zeshen Li, Du Jiang, International journal of wireless and mobile computing . 2021,第1期

机译：基于合成RGB图像的3D对象检测
4. Object Detection on Radar Imagery for Autonomous Driving Using Deep Neural Networks [C] . Ana Stroescu, Liam Daniel, Dominic Phippen, European Radar Conference . 2021

机译：深神经网络自主驾驶雷达图像对象检测
5. Real-Time Object Detection for Autonomous Driving Based on Deep Learning. [D] . Liu, Guangrui. 2017

机译：基于深度学习的自动驾驶实时目标检测
6. PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud [O] . Fangyu Li, Weizheng Jin, Cien Fan, 2021

机译：PSANET：Point云中3D对象检测的金字塔分离和聚合网络
7. Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images [O] . Shuran Song, Jianxiong Xiao 2016

机译：用于RGB-D图像中的模态3D物体检测的深度滑动形状

CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅