首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Action Snippets: How many frames does human action recognition require?
【24h】

Action Snippets: How many frames does human action recognition require?

机译:动作片段:人类行动认可需要多少帧?

获取原文

摘要

Visual recognition of human actions in video clips has been an active field of research in recent years. However, most published methods either analyse an entire video and assign it a single action label, or use relatively large look-ahead to classify each frame. Contrary to these strategies, human vision proves that simple actions can be recognised almost instantaneously. In this paper, we present a system for action recognition from very short sequences ("snippets") of 1-10 frames, and systematically evaluate it on standard data sets. It turns out that even local shape and optic flow for a single frame are enough to achieve ≈ 90% correct recognitions, and snippets of 5-7 frames (0.3-0.5 seconds of video) are enough to achieve a performance similar to the one obtainable with the entire video sequence.
机译:近年来,视频剪辑中的人类行为的视觉认识到了一个积极的研究领域。但是,大多数已发布的方法分析整个视频并将其分配单个动作标签,或者使用相对大的瞻,以对每个帧进行分类。与这些策略相反,人类的愿景证明可以几乎瞬间识别简单的行动。在本文中,我们提出了一个动作识别系统,从非常短的序列(“片段”)为1-10帧,并系统地在标准数据集上评估它。事实证明,单个帧的局部形状和光学流量甚至足以实现≈90%正确识别,而5-7帧的片段(0.3-0.5秒)足以实现类似于可获得的识别性能使用整个视频序列。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号