Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation

机译：基于运动的遮挡感知像素图形网络用于视频对象分段

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper proposes a dual-channel based Graph Convolutional Network (GCN) for the Video Object Segmentation (VOS) task. The main contribution lies in formulating two pixel graphs based on the raw RGB and optical flow features. Both spatial and temporal features are learned independently, making the network robust to various challenging scenarios in real-world videos. Additionally, a motion orientation-based aggregator scheme efficiently captures long-range dependencies among objects. This not only deals with the complex issue of modelling velocity differences among multiple objects moving in various directions, but also adapts to change of appearance of objects due to pose and scale deformations. Also, an occlusion-aware attention mechanism has been employed to facilitate accurate segmentation under scenarios where multiple objects have temporal discontinuity in their appearance due to occlusion. Performance analysis on DAVIS-2016 and DAVIS-2017 datasets show the effectiveness of our proposed method in foreground segmentation of objects in videos over the existing state-of-the-art techniques. Control experiments using CamVid dataset show the generalising capability of the model for scene segmentation.

机译：本文提出了一种用于视频对象分段（VOS）任务的基于双通道的图形卷积网络（GCN）。主要贡献在于基于原始RGB和光流特征在制定两个像素图形。空间和时间特征都是独立学习的，使网络对现实世界视频中的各种具有挑战性的情景进行了鲁棒。另外，基于运动方向的聚合器方案有效地捕获对象之间的远程依赖性。这不仅涉及在各种方向上移动的多个物体之间建模速度差异的复杂问题，而且还适应由于姿势和比例变形而改变物体的外观。此外，已经采用了一种遮挡感知的注意机制来促进根据闭塞由于其外观存在时间不连续的情况下的精确细分。 Davis-2016和Davis-2017数据集的性能分析显示了我们在现有最先进技术中的视频中对象的前景分段中提出的方法的有效性。使用Camvid DataSet的控制实验显示了场景分割模型的推广能力。

著录项

来源
《International Conference on Neural Information Processing》|2019年|709p|共12页
会议地点
作者
Saptakatha Adak; Sukhendu Das;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP183-53;
关键词
Video object segmentation (VOS); Graph convolutional networks (GCN); Aggregation mechanisms; Adversarial training;

机译：视频对象分段（VOS）;图表卷积网络（GCN）;聚合机制;对抗培训;

相似文献

外文文献
中文文献
专利

1. Accurate moving object segmentation in unconstraint videos based on robust seed pixels selection [J] . Wenlong Zhang, Xiaoliang Sun, Qifeng Yu International Journal of Advanced Robotic Systems . 2020,第4期

机译：基于鲁棒种子像素选择的非诱惑视频中的准确移动对象分割
2. Initial Object Segmentation for Video Object Plane GenerationUsing Cellular Neural Networks [J] . 王慧, 杨高波, 张兆杨上海大学学报：英文版 . 2003,第002期

机译：使用细胞神经网络的视频对象平面生成的初始对象分割
3. Unsupervised pixel-level video foreground object segmentation via shortest path algorithm [J] . Cao Xiaochun, Wang Feng, Zhang Bao, Neurocomputing . 2016,第JANa8期

机译：通过最短路径算法的无监督像素级视频前景对象分割
4. Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation [C] . Saptakatha Adak, Sukhendu Das International conference on neural information processing;Annual conference of Asia-Pacific Neural Network Society . 2019

机译：基于运动的遮挡感知像素图网络，用于视频对象分割
5. Motion-based object segmentation from digital video. [D] . Wang, Jian. 1998

机译：基于运动的数字视频对象分割。
6. Automatic Organ Segmentation for CT Scans Based on Super-Pixel and Convolutional Neural Networks [O] . Xiaoming Liu, Shuxu Guo, Bingtao Yang, 2018

机译：基于超像素和卷积神经网络的CT扫描自动器官分割
7. Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks [O] . Yoon, Jae Shin, Rameau, Francois, Kim, Junsik, 2017

机译：使用卷积算法进行视频对象分割的像素级匹配神经网络

Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅