首页> 外文期刊>Journal of visual communication & image representation >Learning spatiotemporal representations for human fall detection in surveillance video
【24h】

Learning spatiotemporal representations for human fall detection in surveillance video

机译:在监控视频中学习人类跌倒检测的时空表示

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a computer vision based framework is proposed that detects falls from surveillance videos. Firstly, we employ background subtraction and rank pooling to model spatial and temporal representations in videos, respectively. We then introduce a novel three-stream Convolutional Neural Networks as an event classifier. Silhouettes and their motion history images serve as input to the first two streams, while dynamic images whose temporal duration is equal to motion history images, are used as input to the third stream. Finally, we apply voting on the results of event classification to perform multi camera fall detection. The main novelty of our method against the conventional ones is that high quality spatiotemporal representations in different levels are learned to take full advantage of the appearance and motion information. Extensive experiments have been conducted on two widely used fall data sets. The results have shown to demonstrate the effectiveness of the proposed method. (C) 2019 Elsevier Inc. All rights reserved.
机译:在本文中,提出了一种基于计算机视觉的框架,检测来自监视视频的落下。首先,我们使用背景减法和等级汇集来分别在视频中模拟空间和时间表示。然后,我们将新的三流卷积神经网络介绍为事件分类器。剪影及其运动历史图像用作前两个流的输入,而当时间持续时间等于运动历史图像的动态图像被用作第三流的输入。最后,我们在事件分类结果上申请表决,以执行多相机坠落检测。我们对传统方法的方法的主要新颖之处在于学习不同级别的高质量时空表示,以充分利用外观和运动信息。已经在两个广泛使用的秋季数据集中进行了广泛的实验。结果表明,展示了该方法的有效性。 (c)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号