...
首页> 外文期刊>International journal of machine learning and cybernetics >A two-stage temporal proposal network for precise action localization in untrimmed video
【24h】

A two-stage temporal proposal network for precise action localization in untrimmed video

机译:一个两阶段时间建议网络,用于未经监测视频中的精确行动定位

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of long untrimmed videos. In the first stage, we propose a novel prior-minor watershed algorithm for action proposals with precise prior watershed proposal algorithm and minor supplementary sliding window algorithm. Here, we propose the correctness discriminator to fill the proposals that watershed proposal algorithm may omit with the sliding window proposals. In the second stage, an extended context pooling (ECP) is firstly proposed with two modules (internal and context). The context information module of ECP can structure the proposals and enhance the extended features of action proposals. Different level of ECP is introduced to model the action proposal region and make its extended context region more targeted and precise. Then, we propose a temporal context regression network, which adopts a multi-task loss to realize the training of the temporal coordinate regression and the action/background classification simultaneously, and outputs the precise temporal boundaries of the proposals. Here, we also propose prior-minor ranking to balance the effect of the prior watershed proposals and the minor supplementary proposals. On three large scale benchmarks THUMOS14, ActivityNet (v1.2 and v1.3), and Charades, our approach achieves superior performances compared with other state-of-the-art methods and runs over 1020 frames per second (fps) on a single NVIDIA Titan-X Pascal GPU, indicating that our method can efficiently improve the precision of action localization task.
机译:在本文中,我们提出了一种用于长虚拟视频的动作检测任务的两阶段时间提案算法。在第一阶段,我们提出了一种新的先前微小的流域算法,用于采用精确的先前流域提案算法和次要补充滑动窗口算法的行动提案。在这里,我们提出了正确的鉴别者来填补滑动窗口提案可以省略流域提案算法的提案。在第二阶段,首先提出了两个模块(内部和上下文)的扩展上下文池(ECP)。 ECP的上下文信息模块可以构建提案并增强行动提案的扩展功能。引入不同水平的ECP以模拟行动提案区域,并使其扩展的上下文区域更具针对性和精确。然后,我们提出了一个时间上下文回归网络,它采用多任务丢失来同时实现时间坐标回归和动作/背景分类的训练,并输出提案的精确时间边界。在这里,我们还提出了预先进行了次要的排名,以平衡前分水岭提案和未成年补充建议的效果。在三个大型基准测试Thumos14,ActivityNet(V1.2和V1.3)和Charades中,我们的方法与其他最先进的方法相比,实现了卓越的性能,并在单个中运行每秒超过1020帧(FPS) NVIDIA Titan-X Pascal GPU,表明我们的方法可以有效地提高行动本地化任务的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号