Atrous Temporal Convolutional Network for Video Action Segmentation

机译：Atrous时间卷积网络用于视频动作分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fine-grained temporal human action segmentation in untrimmed videos is receiving increasing attention due to its extensive applications in surveillance, robotics, and beyond. It is crucial for an action segmentation system to be robust to the temporal scale of different actions since in practical applications the duration of an action can vary from less than a second to tens of minutes. In this paper, we introduce a novel atrous temporal convolutional network (AT-Net), which explicitly generates multiscale video contextual representations by utilizing atrous temporal pyramid pooling (ATPP) and has an architecture of encoder-decoder fully convolutional network. In the decoding stage, AT-Net combines multiscale contextual features with low-level local features to generate high-quality action segmentation results. Experiments on the 50 Salads, GTEA and JIGSAWS benchmarks demonstrate that AT-Net achieves improvement over the state of the art.

机译：由于其在监视，机器人等领域的广泛应用，未经修饰的视频中的细粒度人为动作分割正受到越来越多的关注。动作分段系统对不同动作的时间范围的鲁棒性至关重要，因为在实际应用中，动作的持续时间可以从不到一秒到几十分钟不等。在本文中，我们介绍了一种新颖的非平稳时间卷积网络（AT-Net），该网络通过利用非平稳时间金字塔池（ATPP）显式生成多尺度视频上下文表示，并具有编码器-解码器全卷积网络的体系结构。在解码阶段，AT-Net将多尺度上下文特征与低级局部特征相结合，以生成高质量的动作分割结果。在50个色拉，GTEA和JIGSAWS基准测试中进行的实验表明，AT-Net在最先进的技术上取得了进步。

著录项

来源
《IEEE International Conference on Image Processing》|2019年|1585-1589|共5页
会议地点
作者
Jiahao Wang; Zhengyin Du; Annan Li; Yunhong Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Action segmentation; multiscale modeling; atrous temporal convolution; temporal pyramid pooling;

机译：动作分割;多尺度建模;空时卷积;时空金字塔池;

相似文献

外文文献
中文文献
专利

1. AFPNet: A 3D fully convolutional neural network with atrous-convolution feature pyramid for brain tumor segmentation via MRI images [J] . Zhou Zexun, He Zhongshi, Jia Yuanyuan Neurocomputing . 2020,第Auga18期

机译：AFPNET：3D完全卷积神经网络，带有不满的卷积，通过MRI图像进行脑肿瘤分割的金字塔
2. Atrous convolutional feature network for weakly supervised semantic segmentation [J] . Xu Lian, Xue Hao, Bennamoun Mohammed, Neurocomputing . 2021,第Jana15期

机译：用于弱监督语义细分的酷刑卷积特征网络
3. Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos [J] . Meng Bo, Liu XueJun, Wang Xiaolin Multimedia Tools and Applications . 2018,第20期

机译：基于四元数时空卷积神经网络和LSTM的RGB视频人体动作识别
4. Atrous Temporal Convolutional Network for Video Action Segmentation [C] . Jiahao Wang, Zhengyin Du, Annan Li, IEEE International Conference on Image Processing . 2019

机译：用于视频动作分割的众所周知的时间卷积网络
5. Plant Segmentation by Supervised Machine Learning Methods and Phenotypic Trait Extraction of Soybean Plants Using Deep Convolutional Neural Networks with Transfer Learning [D] . Adams, Jason R. 2020

机译：植物分割通过深度卷积神经网络与转移学习的豆豆植物的植物分割和表型特性
6. Cascaded atrous convolution and spatial pyramid pooling for more accurate tumor target segmentation for rectal cancer radiotherapy [O] . Kuo Men, Pamela Boimel, James Janopaul-Naylor, -1

机译：级联无环卷积和空间金字塔池化可更精确地分割直肠癌放射治疗的肿瘤靶标
7. CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos [O] . Shou, Zheng, Chan, Jonathan, Zareian, Alireza, 2017

机译：CDC：用于精确时间行动的卷积 - 反卷积网络未修剪视频中的本地化

Atrous Temporal Convolutional Network for Video Action Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅