首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Progressively Parsing Interactional Objects for Fine Grained Action Detection
【24h】

Progressively Parsing Interactional Objects for Fine Grained Action Detection

机译:逐步解析交互对象以进行细粒度的动作检测

获取原文

摘要

Fine grained video action analysis often requires reliable detection and tracking of various interacting objects and human body parts, denoted as Interactional Object Parsing. However, most of the previous methods based on either independent or joint object detection might suffer from high model complexity and challenging image content, e.g., illumination/pose/appearance/scale variation, motion, and occlusion etc. In this work, we propose an end-to-end system based on recurrent neural network to perform frame by frame interactional object parsing, which can alleviate the difficulty through an incremental/progressive manner. Our key innovation is that: instead of jointly outputting all object detections at once, for each frame we use a set of long-short term memory (LSTM) nodes to incrementally refine the detections. After passing through each LSTM node, more object detections are consolidated and thus more contextual information could be utilized to localize more difficult objects. The object parsing results are further utilized to form object specific action representation for fine grained action detection. Extensive experiments on two benchmark fine grained activity datasets demonstrate that our proposed algorithm achieves better interacting object detection performance, which in turn boosts the action recognition performance over the state-of-the-art.
机译:细粒度的视频动作分析通常需要可靠地检测和跟踪各种交互对象和人体部位,称为“交互对象解析”。然而,基于独立或联合对象检测的大多数先前方法可能会遭受较高的模型复杂性和具有挑战性的图像内容的困扰,例如照明/姿势/外观/比例变化,运动和遮挡等。在这项工作中,我们提出了一种基于递归神经网络的端到端系统进行逐帧交互对象解析,可以通过增量/渐进的方式减轻难度。我们的主要创新之处在于:而不是一次联合输出所有对象检测,对于每一帧,我们使用一组长期短期记忆(LSTM)节点来逐步完善检测。通过每个LSTM节点后,将合并更多的对象检测,因此可以利用更多的上下文信息来定位更困难的对象。对象解析结果还用于形成对象特定的动作表示,以进行细粒度的动作检测。在两个基准细粒度活动数据集上进行的大量实验表明,我们提出的算法可实现更好的交互对象检测性能,从而反过来提高了动作识别性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号