首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Temporal Action Detection Using a Statistical Language Model
【24h】

Temporal Action Detection Using a Statistical Language Model

机译:使用统计语言模型的时间动作检测

获取原文

摘要

While current approaches to action recognition on presegmented video clips already achieve high accuracies, temporal action detection is still far from comparably good results. Automatically locating and classifying the relevant action segments in videos of varying lengths proves to be a challenging task. We propose a novel method for temporal action detection including statistical length and language modeling to represent temporal and contextual structure. Our approach aims at globally optimizing the joint probability of three components, a length and language model and a discriminative action model, without making intermediate decisions. The problem of finding the most likely action sequence and the corresponding segment boundaries in an exponentially large search space is addressed by dynamic programming. We provide an extensive evaluation of each model component on Thumos 14, a large action detection dataset, and report state-of-the-art results on three datasets.
机译:尽管目前对预先分割的视频片段进行动作识别的方法已经实现了很高的准确性,但是时间动作检测仍然远未达到可比的良好结果。在不同长度的视频中自动定位和分类相关动作片段被证明是一项艰巨的任务。我们提出了一种新的时间动作检测方法,包括统计长度和语言模型来表示时间和上下文结构。我们的方法旨在全局优化三个组成部分的联合概率,即长度和语言模型以及区分性行动模型,而无需做出中间决策。通过动态编程解决了在指数大的搜索空间中找到最可能的动作序列和相应的段边界的问题。我们在大型动作检测数据集Thumos 14上对每个模型组件提供了广泛的评估,并在三个数据集上报告了最新的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号