首页> 外文会议>IEEE International Conference on Data Science in Cyberspace >Performance Comparison of Deep Learning Techniques for Recognizing Birds in Aerial Images
【24h】

Performance Comparison of Deep Learning Techniques for Recognizing Birds in Aerial Images

机译:深度学习技术在航空图像中识别鸟类的性能比较

获取原文

摘要

In computer vision, significant advances have been made in recent years on object recognition and detection with the rapid development of deep learning, especially deep convolutional neural networks (CNN). The majority of deep learning methods for object detection have been developed for large objects and their performances on small-object detection are not very good. This paper contributes to research in low-resolution small-object detection by evaluating the performances of leading deep learning methods for object detection using a common dataset, which is a new dataset for bird detection, called Little Birds in Aerial Imagery (LBAI), created from real-life aerial imagery data. LBAI contains birds with sizes ranging from 10px to 40px. In our experiments, some of the best deep learning architectures were implemented and applied to LBAI, which include object detection techniques such as YOLOv2, SSH, and Tiny Face, in addition to small instance segmentation techniques including U-Net and Mask R-CNN. Among the object detection methods, experimental results demonstrated that SSH performed the best for easy cases, whereas Tiny Face performed the best for hard cases, i.e. where a cluttered background makes detecting birds difficult. Among small instance segmentation methods, experimental results revealed U-Net achieved slightly better performance than Mask R-CNN.
机译:在计算机视觉方面,近年来,随着深度学习(尤其是深度卷积神经网络(CNN))的快速发展,在对象识别和检测方面取得了重大进展。大多数用于对象检测的深度学习方法都是针对大型对象开发的,它们在小对象检测方面的性能不是很好。本文通过使用通用数据集评估领先的深度学习方法进行对象检测,从而为低分辨率小对象检测研究做出了贡献,该通用数据集是一种新的鸟类检测数据集,称为航空影像中的小鸟(LBAI),已创建来自现实的航空影像数据。 LBAI包含大小在10像素到40像素之间的鸟类。在我们的实验中,一些最佳的深度学习架构已实现并应用于LBAI,其中包括对象检测技术(例如YOLOv2,SSH和Tiny Face),以及小实例分割技术(包括U-Net和Mask R-CNN)。在对象检测方法中,实验结果表明SSH在易遇到的情况下表现最佳,而Tiny Face在遇到困难的情况下表现最佳,即背景杂乱,很难检测到鸟类。在小实例分割方法中,实验结果表明,U-Net的性能比Mask R-CNN略好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号