首页> 外文会议>International Conference on Web Research >Human Action Recognition in Video Using DB-LSTM and ResNet
【24h】

Human Action Recognition in Video Using DB-LSTM and ResNet

机译:使用DB-LSTM和RESET的视频中的人类行动识别

获取原文

摘要

Human action recognition in video is one of the most widely applied topics in the field of image and video processing, with many applications in surveillance (security, sports, etc.), activity detection, video-content-based monitoring, man-machine interaction, and health/disability care. Action recognition is a complex process that faces several challenges such as occlusion, camera movement, viewpoint move, background clutter, and brightness variation. In this study, we propose a novel human action recognition method using convolutional neural networks (CNN) and deep bidirectional LSTM (DB-LSTM) networks, using only raw video frames. First, deep features are extracted from video frames using a pre-trained CNN architecture called ResNet152. The sequential information of the frames is then learned using the DB-LSTM network, where multiple layers are stacked together in both forward and backward passes of DB-LSTM, to increase depth. The evaluation results of the proposed method using PyTorch, compared to the state-of-the-art methods, show a considerable increase in the efficiency of action recognition on the UCF 101 dataset, reaching 95% recognition accuracy. The choice of the CNN architecture, proper tuning of input parameters, and techniques such as data augmentation contribute to the accuracy boost in this study.
机译:视频中的人类行动识别是图像和视频处理领域最广泛应用的主题之一,在监控(安全,体育等),活动检测,基于视频内容的监控,人机交互中的许多应用程序和健康/残疾护理。行动识别是一个复杂的过程,面临诸如遮挡,相机运动,观点移动,背景杂波和亮度变化的几个挑战。在这项研究中,我们仅使用Raw视频帧使用卷积神经网络(CNN)和深双向LSTM(DB-LSTM)网络的新型人体行动识别方法。首先,使用称为Resnet152的预先训练的CNN架构从视频帧中提取深度特征。然后使用DB-LSTM网络学习帧的顺序信息,其中多个层在DB-LSTM的前向和后向后堆叠在一起,以增加深度。与最先进的方法相比,使用Pytorch的所提出方法的评估结果显示了UCF 101数据集上的动作识别效率的相当大,达到了95%的识别精度。 CNN架构的选择,正确调整输入参数以及数据增强等技术有助于本研究中的准确性提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号