Detection of Big Animals on Images with Road Scenes using Deep Learning

机译：使用深度学习在道路场景图像上检测大动物

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recognition of big animals on the images with road scenes has received little attention in modern research. There are very few specialized data sets for this task. Popular open data sets contain many images of big animals, but the most part of them is not correspond to road scenes that is necessary for on-board vision systems of unmanned vehicles. The paper describes the preparation of such a specialized data set based on Google Open Images and COCO datasets. The resulting data set contains about 20000 images of big animals of 10 classes: 'Bear', 'Fox', 'Dog', 'Horse', 'Goat', 'Sheep', 'Cow', 'Zebra', 'Elephant', 'Giraffe'. Deep learning approaches to detect these objects are researched in the paper. Authors trained and tested modern neural network architectures YOLOv3, RetinaNet R-50-FPN, Faster R-CNN R-50-FPN, Cascade R-CNN R-50-FPN. To compare the approaches the mean average precision (mAP) was determined at IoU≥50%, also their speed was calculated for input tensor sizes 640x384x3. The highest quality metrics are demonstrated by architecture YOLOv3 as for ten classes (0.78 mAP) and one joint class (0.92 mAP) detection with speed more 35 fps on NVidia Tesla V-100 32GB video card. At the same hardware, the RetinaNet R-50-FPN architecture provided recognition speed of more than 44 fps and a 13% lower mAP. The software implementation was done using the Keras and PyTorch deep learning libraries and NVidia CUDA technology. The proposed data set and neural network approach to recognizing big animals on images have shown their effectiveness and can be used in the on-board vision systems of driverless cars or in driver assistant systems.

机译：在具有道路场景的图像上对大型动物的识别在现代研究中很少受到关注。很少有专门的数据集可以完成此任务。流行的开放数据集包含许多大型动物的图像，但是其中大部分都不对应于无人驾驶汽车的车载视觉系统所必需的道路场景。本文介绍了基于Google Open Images和COCO数据集的此类专用数据集的准备。结果数据集包含约20000张10类大型动物的图像：“熊”，“狐狸”，“狗”，“马”，“山羊”，“绵羊”，“母牛”，“斑马”，“大象” ，“长颈鹿”。本文研究了检测这些对象的深度学习方法。作者培训并测试了现代神经网络体系结构YOLOv3，RetinaNet R-50-FPN，Faster R-CNN R-50-FPN，Cascade R-CNN R-50-FPN。为了比较这些方法，在IoU≥50％时确定了平均平均精度（mAP），并且还针对输入张量大小640x384x3计算了它们的速度。 YOLOv3体系结构在NVidia Tesla V-100 32GB视频卡上以十种等级（0.78 mAP）和一种联合等级（0.92 mAP）的检测速度达到了35 fps以上，证明了最高的质量指标。在相同的硬件上，RetinaNet R-50-FPN架构提供了超过44 fps的识别速度，并且mAP降低了13％。该软件的实现是使用Keras和PyTorch深度学习库以及NVidia CUDA技术完成的。所提出的用于在图像上识别大动物的数据集和神经网络方法已显示出它们的有效性，可用于无人驾驶汽车的车载视觉系统或驾驶员辅助系统。

著录项

来源
《International Conference on Artificial Intelligence: Applications and Innovations》|2019年|100-1003|共904页
会议地点 Belgrade(RS)
作者
Dmitry Yudin; Anton Sotnikov; Andrey Krishtopik;
展开▼
作者单位

Moscow Institute of Physics and Technology (National Research University);

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Animals; Neural networks; Roads; Feature extraction; Measurement; Training; Machine learning;

机译：动物;神经网络;道路；特征提取;测量;训练;机器学习;

相似文献

外文文献
中文文献
专利

1. Deep learning for detection of text polarity in natural scene images [J] . Perepu Pavan Kumar Neurocomputing . 2021,第Mara28期

机译：在自然场景图像中检测文本极性的深度学习
2. RDD2020: An annotated image dataset for automatic road damage detection using deep learning [J] . Deeksha Arya, Hiroya Maeda, Sanjay Kumar Ghosh, Data in Brief . 2021,第a期

机译：RDD2020：使用深度学习的用于自动道路损坏检测的注释图像数据集
3. OBJECT DETECTION FROM MMS IMAGERY USING DEEP LEARNING FOR GENERATION OF ROAD ORTHOPHOTOS [J] . Li Y., Sakamoto M., Shinohara T., International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2018,第4期

机译：使用深度学习生成道路矫正器从MMS影像中进行对象检测
4. Detection of Big Animals on Images with Road Scenes using Deep Learning [C] . Dmitry Yudin, Anton Sotnikov, Andrey Krishtopik International Conference on Artificial Intelligence: Applications and Innovations . 2019

机译：利用深层学习检测道路场景的大型动物
5. Distress Detection of Road Images Using Deep Learning [D] . Goud, Rishabh Sanjay 2019

机译：使用深度学习对道路图像进行遇险检测
6. RDD2020: An annotated image dataset for automatic road damage detection using deep learning [O] . Deeksha Arya, Hiroya Maeda, Sanjay Kumar Ghosh, 2021

机译：RDD2020：用于使用深度学习的自动道路损坏检测的注释图像数据集
7. Failure Detection for Semantic Segmentation on Road Scenes Using Deep Learning [O] . Junho Song, Woojin Ahn, Sangkyoo Park, 2021

机译：利用深度学习对道路场景中的语义细分失败检测

Detection of Big Animals on Images with Road Scenes using Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅