Deep Multi Task Learning based Object Detection and Semantic Segmentation Network for Autonomous Driving applications

机译：基于多任务学习的自主驾驶应用的对象检测和语义分段网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional Neural Networks (CNN) are successfully used for various visual perception tasks including bounding box object detection, semantic segmentation, optical flow, depth estimation, visual SLAM, etc. Generally these tasks are independently explored and modeled. In this paper, we present a joint multi-task network design for learning various such tasks simultaneously. The main advantages are increased run time efficiency through shared network parameters across tasks, scalability to add more tasks lever-aging previous features and better generalization through inductive transfer. We provide a systematic taxonomy of multi-task learning CNN topologies based on an extensive survey of various architectures, loss functions and training strategies. We classified Deep Multi Task Learning (DMTL) topologies into 5 categories namely Parallel & Sequential task branch, Soft parameter sharing, Hierarchical representation and Recurrent topologies. The proposed network jointly learns object detection and semantic segmentation and is implemented in Keras & Tensorflow Frameworks. The network architecture consists of ResNet-10 as a common trunk and two task dependent decoders-YOLO like decoder for object detection and FCN8 like decoder for semantic segmentation. We demonstrate the prototype on wide-angle fisheye lens cameras which are becoming popular for automated driving because of their large FOV. We believe that this is the first work to demonstrate the DMTL on surround view fisheye cameras.

机译：卷积神经网络（CNN）已成功用于各种视觉感知任务，包括边界框对象检测，语义分割，光流，深度估计，视觉SLAM等。通常，这些任务是独立探索和建模的。在本文中，我们介绍了一个联合多任务网络设计，用于同时学习各种此类任务。主要优点是通过对任务的共享网络参数，可扩展性来增加运行时间效率，以通过电感转移添加更多任务杠杆老化先前的特征和更好的泛化。基于对各种架构，损失功能和培训策略的广泛调查，我们提供了多任务学习CNN拓扑的系统分类。我们将深度多任务学习（DMTL）拓扑分为5类，即并行和顺序任务分支，软参数共享，分层表示和经常性拓扑。所提出的网络共同学习对象检测和语义分割，并在Keras和Tensorflow框架中实现。网络架构由Reset-10作为常见的中继和两个任务依赖于解码器，如解码器，用于对象检测和用于语义分割的解码器等FCN8。我们展示了广角鱼眼镜摄像机上的原型，这对于自动驾驶而受到自动驾驶的热门驾驶。我们认为这是第一个在Surround View Fisheye相机上展示DMTL的工作。

著录项

来源
《International Conference and Exhibition of the SIA VISION》|2018年|1(CD-ROM)|共9页
会议地点
作者
Ganesh Sistu; Isabelle Leang; Senthil Yogamani;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 U463-53;
关键词

相似文献

外文文献
中文文献
专利

1. Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges [J] . Feng Di, Haase-Schutz Christian, Rosenbaum Lars, IEEE Transactions on Intelligent Transportation Systems . 2021,第3期

机译：自主驾驶的深度多模态对象检测和语义分割：数据集，方法和挑战
2. Object recognition and detection with deep learning for autonomous driving applications [J] . Ucar Aysegul, Demir Yakup, Guzelis Cuneyt Simulation . 2017,第9期

机译：通过深度学习进行对象识别和检测，以实现自动驾驶应用
3. Object Detection with Deep Neural Networks for Reinforcement Learning in the Task of Autonomous Vehicles Path Planning at the Intersection [J] . D. A. Yudin, A. Skrynnik, A. Krishtopik, Optical memory & neural networks . 2019,第4期

机译：具有深层神经网络的对象检测，用于在交叉路口自主车辆路径规划任务中的增强学习
4. Deep Multi Task Learning based Object Detection and Semantic Segmentation Network for Autonomous Driving applications [C] . Ganesh Sistu, Isabelle Leang, Senthil Yogamani International Conference and Exhibition of the SIA VISION . 2018

机译：基于多任务学习的自主驾驶应用的对象检测和语义分段网络
5. Applications of Deep Learning in Large-Scale Object Detection and Semantic Segmentation [D] . Xiang, Wei. 2018

机译：深度学习在大规模目标检测和语义分割中的应用
6. Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images [O] . Baohua Qiang, Ruidong Chen, Mingliang Zhou, 2020

机译：基于卷积神经网络的对象检测算法通过连接图像的语义分割
7. Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges [O] . Di Feng, Christian Haase-Schutz, Lars Rosenbaum, 2021

机译：自主驾驶的深度多模态对象检测和语义分割：数据集，方法和挑战

Deep Multi Task Learning based Object Detection and Semantic Segmentation Network for Autonomous Driving applications

摘要

著录项

相似文献

相关主题

期刊订阅