首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos
【24h】

A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos

机译:基于建议的未修剪视频时空动作检测解决方案

获取原文

摘要

Existing approaches for spatio-temporal action detection in videos are limited by the spatial extent and temporal duration of the actions. In this paper, we present a modular system for spatio-temporal action detection in untrimmed surveillance videos. We propose a two stage approach. The first stage generates dense spatio-temporal proposals using hierarchical clustering and temporal jittering techniques on frame-wise object detections. The second stage is a Temporal Refinement I3D (TRI-3D) network that performs action classification and temporal refinement on the generated proposals. The object detection-based proposal generation step helps in detecting actions occurring in a small spatial region of a video frame, while temporal jittering and refinement helps in detecting actions of variable lengths. Experimental results on an unconstrained surveillance action detection dataset - DIVA - show the effectiveness of our system. For comparison, the performance of our system is also evaluated on the THUMOS'14 temporal action detection dataset.
机译:视频中时空动作检测的现有方法受到动作的空间范围和时间持续时间的限制。在本文中,我们提出了一种用于在未修剪的监控视频中进行时空动作检测的模块化系统。我们提出了一种两阶段的方法。第一阶段使用逐帧对象检测中的分层聚类和时间抖动技术生成密集的时空建议。第二阶段是时间细化I3D(TRI-3D)网络,该网络对生成的建议进行操作分类和时间细化。基于对象检测的建议生成步骤有助于检测在视频帧的较小空间区域中发生的动作,而时间抖动和优化则有助于检测可变长度的动作。在不受约束的监视动作检测数据集DIVA上的实验结果表明了我们系统的有效性。为了进行比较,我们还在THUMOS'14的临时动作检测数据集上评估了我们系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号