首页> 外文会议>IEEE International Conference on Image Processing >Rethinking Temporal Structure Modeling Method for Temporal Action Localization
【24h】

Rethinking Temporal Structure Modeling Method for Temporal Action Localization

机译:对时间行为定位的时间结构建模方法的重新思考

获取原文

摘要

Temporal action localization in untrimmed videos is an important but difficult task. Difficulties are encountered in the application of existing methods when modeling temporal structures of videos. In the present study, we developed a novel method, referred to as Gemini Network, for effective modeling of temporal structures and achieving high-performance temporal action localization. The significant improvements afforded by the proposed method are attributable to three major factors. First, the developed network utilizes two sub-nets for effective modeling of temporal structures. Second, three parallel feature extraction pipelines are used to prevent interference between the extractions of different stage features. Third, the proposed method utilizes auxiliary supervision, with the auxiliary classifier losses affording additional constraints for improving the modeling capability of the network. As a demonstration of its effectiveness, the Gemini Network was used to achieve state-of-the-art temporal action localization performance on two challenging datasets, namely, THUMOS14 and ActivityNet.
机译:未修剪视频中的时间动作本地化是一项重要但艰巨的任务。在对视频的时间结构进行建模时,在现有方法的应用中遇到了困难。在本研究中,我们开发了一种称为Gemini Network的新颖方法,用于对时间结构进行有效建模并实现高性能的时间动作定位。所提出的方法所带来的重大改进可归因于三个主要因素。首先,开发的网络利用两个子网对时间结构进行有效建模。其次,使用三个并行的特征提取管线来防止不同阶段特征的提取之间的干扰。第三,提出的方法利用辅助监督,辅助分类器损失为提高网络的建模能力提供了额外的约束。为了证明其有效性,双子网络被用于在两个具有挑战性的数据集THUMOS14和ActivityNet上实现最新的时间动作定位性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号