首页> 外文会议>Asian Conference on Computer Vision >Dynamic Temporal Pyramid Network: A Closer Look at Multi-scale Modeling for Activity Detection
【24h】

Dynamic Temporal Pyramid Network: A Closer Look at Multi-scale Modeling for Activity Detection

机译:动态时间金字塔网络:活动检测的多尺度建模的近距离观察

获取原文

摘要

Recognizing instances at varying scales simultaneously is a fundamental challenge in visual detection problems. While spatial multi-scale modeling has been well studied in object detection, how to effectively apply a multi-scale architecture to temporal models for activity detection is still under-explored. In this paper, we identify three unique challenges that need to be specifically handled for temporal activity detection. To address all these issues, we propose Dynamic Temporal Pyramid Network (DTPN), a new activity defection framework with a multi-scale pyramidal architecture featuring three novel designs: (1) We sample frame sequence dynamically with different frame per seconds (PPS) to construct a natural pyramidal representation for arbitrary-length input videos. (2) We design a two-branch multi-scale temporal feature hierarchy to deal with the inherent temporal scale variation of activity instances. (3) We further exploit the temporal context of activities by appropriately fusing multi-scale feature maps, and demonstrate that both local and global temporal contexts are important. By combining all these components into a uniform network, we end up with a single-shot activity detector involving single-pass inferencing and end-to-end training. Extensive experiments show that the proposed DTPN achieves state-of-the-art performance on the challenging ActvityNet dataset.
机译:同时识别不同比例的实例是视觉检测问题中的一项基本挑战。尽管在对象检测中已经对空间多尺度建模进行了很好的研究,但是如何有效地将多尺度体系结构应用于时间模型以进行活动检测仍处于探索中。在本文中,我们确定了三个暂时性的挑战,这些挑战需要专门针对时间活动检测进行处理。为了解决所有这些问题,我们提出了动态时间金字塔网络(DTPN),这是一种具有多尺度金字塔体系结构的新型活动缺陷框架,具有三种新颖的设计:(1)我们以每秒不同的帧数(PPS)动态采样帧序列,以达到以下目的:为任意长度的输入视频构建自然的金字塔表示。 (2)设计了两分支的多尺度时间特征层次结构,以处理活动实例的固有时间尺度变化。 (3)通过适当地融合多尺度特征图,我们进一步开发了活动的时间语境,并证明了本地和全局的时间语境都很重要。通过将所有这些组件组合到一个统一的网络中,我们最终得到了包含单次通过推理和端到端训练的单次活动检测器。大量实验表明,所提出的DTPN在具有挑战性的ActvityNet数据集上达到了最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号