Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models

Nouar AlDahoul; Aznul Qalid Md Sabri; Ali Mohammed Mansoor

首页> 外文期刊>Computational intelligence and neuroscience >Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models

【24h】

Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models

机译：通过深度模型对空中捕获的视频序列进行实时人体检测

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human detection in videos plays an important role in various real life applications. Most of traditional approaches depend on utilizing handcrafted features which are problem-dependent and optimal for specific tasks. Moreover, they are highly susceptible to dynamical events such as illumination changes, camera jitter, and variations in object sizes. On the other hand, the proposed feature learning approaches are cheaper and easier because highly abstract and discriminative features can be produced automatically without the need of expert knowledge. In this paper, we utilize automatic feature learning methods which combine optical flow and three different deep models (i.e., supervised convolutional neural network (S-CNN), pretrained CNN feature extractor, and hierarchical extreme learning machine) for human detection in videos captured using a nonstatic camera on an aerial platform with varying altitudes. The models are trained and tested on the publicly available and highly challenging UCF-ARG aerial dataset. The comparison between these models in terms of training, testing accuracy, and learning speed is analyzed. The performance evaluation considers five human actions (digging, waving, throwing, walking, and running). Experimental results demonstrated that the proposed methods are successful for human detection task. Pretrained CNN produces an average accuracy of 98.09%. S-CNN produces an average accuracy of 95.6% with soft-max and 91.7% with Support Vector Machines (SVM). H-ELM has an average accuracy of 95.9%. Using a normal Central Processing Unit (CPU), H-ELM’s training time takes 445 seconds. Learning in S-CNN takes 770 seconds with a high performance Graphical Processing Unit (GPU).

机译：视频中的人体检测在各种现实应用中都扮演着重要角色。大多数传统方法都依赖于利用手工制作的功能，这些功能依赖于问题并且对于特定任务而言是最佳的。此外，它们极易受到动态事件的影响，例如照明变化，相机抖动和物体尺寸变化。另一方面，提出的特征学习方法更便宜，更容易，因为无需专家知识即可自动生成高度抽象和具有区别性的特征。在本文中，我们利用结合了光流和三种不同深度模型（即监督卷积神经网络（S-CNN），预训练的CNN特征提取器和分层极端学习机）的自动特征学习方法，对使用以下方法捕获的视频进行人为检测高度可变的空中平台上的非静态相机。这些模型在公开可用且极富挑战性的UCF-ARG航空数据集上进行了培训和测试。分析了这些模型在训练，测试准确性和学习速度方面的比较。绩效评估考虑了五种人类行为（挖掘，挥舞，投掷，行走和奔跑）。实验结果表明，所提出的方法是成功的人体检测任务。预训练的CNN的平均准确性为98.09％。 S-CNN使用soft-max产生的平均准确度为95.6％，使用支持向量机（SVM）产生的平均准确度为91.7％。 H-ELM的平均准确度为95.9％。使用普通的中央处理器（CPU），H-ELM的训练时间为445秒。使用高性能图形处理单元（GPU），在S-CNN中学习需要770秒。

著录项

来源
《Computational intelligence and neuroscience》 |2018年第3期|共页
作者
Nouar AlDahoul; Aznul Qalid Md Sabri; Ali Mohammed Mansoor;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类寄生生物学;
关键词

相似文献

外文文献
中文文献
专利

1. Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models [J] . Nouar AlDahoul, Aznul Qalid Md Sabri, Ali Mohammed Mansoor, Computational intelligence and neuroscience . 2018,第Pta1期

机译：通过深层模型实时人性检测空中捕获的视频序列
2. Real-time fast moving object tracking in severely degraded videos captured by unmanned aerial vehicle [J] . Sheng Liu, Yuan Feng International Journal of Advanced Robotic Systems . 2018,第1期

机译：实时快速移动物体跟踪，可对无人驾驶飞机捕获的严重降级的视频进行跟踪
3. Compact real-time modeling of seated humans by video sprite sequence quantization [J] . Chun Jia, Voicu Popescu The Visual Computer . 2009,第5a7期

机译：通过视频子画面序列量化对坐着的人进行紧凑的实时建模
4. The Application of Real-Time Object Detection on Aerial HD Videos Based on Deep CNN [C] . Guanghui Xu, Shuai Xie, Jun Wang, International Conference on Computer Science, Electronics and Communication Engineering . 2018

机译：实时对象检测在基于深CNN的空中高清视频中的应用
5. Real-Time Visual Detection and Tracking Framework Using Deep Convolutional Neural Networks for Micro-Aerial Vehicle [D] . Lao, Mingjie. 2018

机译：使用深卷积神经网络进行实时视觉检测和跟踪框架用于微空气车辆
6. Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models [O] . Nouar AlDahoul, Aznul Qalid Md Sabri, Ali Mohammed Mansoor 2018

机译：通过深度模型对空中捕获的视频序列进行实时人体检测
7. REAL-TIME MOVING OBJECT DETECTION IN VIDEO SEQUENCES USING SPATIO-TEMPORAL ADAPTIVE GAUSSIAN MIXTURE MODELS [O] . Katharina Quast, Matthias Obermann, André Kaup 2010

机译：利用时空自适应高斯混合模型实现视频序列中的实时运动目标检测

Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models

摘要

著录项

相似文献

相关主题

期刊订阅