A New Temporal Deconvolutional Pyramid Network for Action Detection

机译：一种新的用于动作检测的时间反卷积金字塔网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Temporal action detection is a challenging task for detecting various action instances in untrimmed videos. Existing detection approaches are unable to localize the start and end time of action instances precisely. To address this issue, we propose a novel Temporal Deconvolutional Pyramid Network (TDPN), in which a Temporal Decon-volution Fusion (TDF) module in each pyramidal hierarchy is developed to construct strong semantic features of multiple temporal scales for detecting action instances with various durations. In the TDF module, the temporal resolution of high-level feature is expanded by a temporal deconvolution. The expanded high-level features and low-level features are fused by a fusion strategy to form strong semantic features. The fused semantic features with multiple temporal scales are used to predict action categories and boundary offsets simultaneously, which significantly improves the detection performance. Besides, a strict strategy for assigning label is proposed during training to improve the precision of temporal boundaries learned by model. We evaluate our detection approach on two public datasets, i.e., THUMOS14 and MEXaction2. The experimental results have demonstrated that our TDPN model can achieve competitive performance on THUMOS14 and best performance on MEXaction2 compared with the other approaches.

机译：时间动作检测对于检测未修剪的视频中的各种动作实例是一项具有挑战性的任务。现有的检测方法无法精确定位动作实例的开始时间和结束时间。为了解决这个问题，我们提出了一种新颖的时间反卷积金字塔网络（TDPN），其中在每个金字塔层次结构中开发了一个时间反卷积融合（TDF）模块，以构造多个时间尺度的强大语义特征，以检测各种动作实例。持续时间。在TDF模块中，通过时间反卷积扩展高级特征的时间分辨率。扩展的高级功能和低级功能通过融合策略进行融合，以形成强大的语义功能。具有多个时间尺度的融合语义特征可用于同时预测动作类别和边界偏移，从而显着提高检测性能。此外，在训练过程中提出了严格的标签分配策略，以提高模型学习到的时间边界的精度。我们评估了两个公共数据集（即THUMOS14和MEXaction2）的检测方法。实验结果表明，与其他方法相比，我们的TDPN模型可以在THUMOS14上达到竞争性能，在MEXaction2上达到最佳性能。

著录项

来源
《Asian Conference on Computer Vision》|2018年|696-711|共16页
会议地点
作者
Xiangli Ji; Guibo Luo; Yuesheng Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Action detection; Untrimined videos; TDPN network;

机译：动作检测;未分类的视频; TDPN网络;

相似文献

外文文献
中文文献
专利

1. 用于目标检测的邻域融合与分层并行特征金字塔网络 [J] . 莫凌飞, 胡书铭东南大学学报（英文版） . 2020,第003期
2. Street-view change detection with deconvolutional networks [J] . Alcantarilla Pablo F., Stent Simon, Ros German, Autonomous robots . 2018,第7期

机译：用碎屑网络进行街头视图改变检测
3. Geospatial Object Detection via Deconvolutional Region Proposal Network [J] . Wang Chen, Shi Jun, Yang Xiaqing, Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2019,第8期

机译：通过反卷积区域提议网络进行地理空间目标检测
4. SEMANTIC SEGMENTATION AND UNREGISTERED BUILDING DETECTION FROM UAV IMAGES USING A DECONVOLUTIONAL NETWORK [J] . Ham S., Oh Y., Choi K., International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2018,第4aW6期

机译：使用反卷积网络对无人机图像进行语义分割和未注册建筑物检测
5. A New Temporal Deconvolutional Pyramid Network for Action Detection [C] . Xiangli Ji, Guibo Luo, Yuesheng Zhu Asian Conference on Computer Vision . 2019

机译：用于动作检测的新的颞碎屑金字塔网络
6. Multi-Scale Object Detection in Aerial Images with Feature Pyramid Networks [D] . Bhattarai, Sujan 2018

机译：具有特征金字塔网络的航空影像多尺度目标检测
7. Base-pair resolution detection of transcription factor binding site by deep deconvolutional network [O] . Sirajul Salekin, Jianqiu Michelle Zhang, Yufei Huang -1

机译：深度反卷积网络的碱基对分辨率检测转录因子结合位点
8. Multi-level Temporal Pyramid Network for Action Detection [O] . Xiang Wang, Changxin Gao, Shiwei Zhang, 2020

机译：用于动作检测的多级时间金字塔网络

A New Temporal Deconvolutional Pyramid Network for Action Detection

摘要

著录项

相似文献

相关主题

期刊订阅