End-to-End Joint Semantic Segmentation of Actors and Actions in Video

机译：视频中演员和动作的端到端联合语义分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional video understanding tasks include human action recognition and actor/object semantic segmentation. However, the combined task of providing semantic segmentation for different actor classes simultaneously with their action class remains a challenging but necessary task for many applications. In this work, we propose a new end-to-end architecture for tackling this task in videos. Our model effectively leverages multiple input modalities, contextual information, and multitask learning in the video to directly output semantic segmentations in a single unified framework. We train and benchmark our model on the Actor-Action Dataset (A2D) for joint actor-action semantic segmentation, and demonstrate state-of-the-art performance for both segmentation and detection. We also perform experiments verifying our approach improves performance for zero-shot recognition, indicating generalizabil-ity of our jointly learned feature space.

机译：传统的视频理解任务包括人类动作识别和演员/对象语义分割。但是，同时为不同的actor类及其动作类提供语义分割的组合任务对于许多应用程序来说仍然是一项具有挑战性但必不可少的任务。在这项工作中，我们提出了一种新的端到端架构来解决视频中的这一任务。我们的模型有效地利用了视频中的多种输入方式，上下文信息和多任务学习，以在单个统一框架中直接输出语义细分。我们在Actor-Action数据集（A2D）上对模型进行了训练和基准测试，以实现联合actor-action语义分割，并展示了分割和检测方面的最新性能。我们还进行了实验，验证了我们的方法提高了零镜头识别的性能，表明了我们共同学习的特征空间的普遍性。

著录项

来源
《European conference on computer vision》|2018年|734-749|共16页
会议地点
作者
Jingwei Ji; Shyamal Buch; Alvaro Soto; Juan Carlos Niebles;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantic segmentation; Actor; Action; Video End-to-End; Zero-shot;

机译：语义分割;演员;行动;视频端到端;零射;

相似文献

外文文献
中文文献
专利

1. A STUDY OF ACTOR AND ACTION SEMANTIC RETENTION IN VIDEO SUPERVOXEL SEGMENTATION [J] . CHENLIANG XU, RICHARD F. DOELL, STEPHEN JOSé HANSON, International journal of semantic computing . 2013,第4期

机译：视频超体素分割中的动词和动作语义保留研究
2. A Weakly Supervised Multi-task Ranking Framework for Actor-Action Semantic Segmentation [J] . International Journal of Computer Vision . 2020,第5期

机译：用于演员 - 动作语义分割的弱监督多任务排名框架
3. Rethinking Separable Convolutional Encoders for End-to-End Semantic Image Segmentation [J] . Lin Wang, Xingfu Wang, Ammar Hawbani, Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：重新思考端到端语义图像分割的可分离卷积编码器
4. Are Actor and Action Semantics Retained in Video Supervoxel Segmentation? [C] . Xu Chenliang, Doell Richard F., Hanson Stephen Jose, IEEE International Conference on Semantic Computing . 2013

机译：视频超体分割中是否保留了演员和动作语义？
5. Using Optical Flow to Improve Semantic Video Segmentation. [D] . Gorgen, Justin. 2017

机译：使用光流改善语义视频分割。
6. An End-to-End Oil-Spill Monitoring Method for Multisensory Satellite Images Based on Deep Semantic Segmentation [O] . Yantong Chen, Yuyang Li, Junsheng Wang 2020

机译：基于深度语义分割的多传感器卫星图像端到端漏油监测方法
7. A study of actor and action semantic retention in video supervoxel segmentation [O] . Chenliang Xu, Richard F. Doell, Stephen Jose ́ Hanson, 2014

机译：视频超体素分割中演员和动作语义保留的研究

End-to-End Joint Semantic Segmentation of Actors and Actions in Video

摘要

著录项

相似文献

相关主题

期刊订阅