首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >Semantic Instance Meets Salient Object: Study on Video Semantic Salient Instance Segmentation
【24h】

Semantic Instance Meets Salient Object: Study on Video Semantic Salient Instance Segmentation

机译:语义实例符合突出对象:研究视频语义突出实例分割

获取原文

摘要

Focusing on only semantic instances that only salient in a scene gains more benefits for robot navigation and self-driving cars than looking at all objects in the whole scene. This paper pushes the envelope on salient regions in a video to decompose them into semantically meaningful components, namely, semantic salient instances. We provide the baseline for the new task of video semantic salient instance segmentation (VSSIS), that is, Semantic Instance - Salient Object (SISO) framework. The SISO framework is simple yet efficient, leveraging advantages of two different segmentation tasks, i.e. semantic instance segmentation and salient object segmentation to eventually fuse them for the final result. In SISO, we introduce a sequential fusion by looking at overlapping pixels between semantic instances and salient regions to have non-overlapping instances one by one. We also introduce a recurrent instance propagation to refine the shapes and semantic meanings of instances, and an identity tracking to maintain both the identity and the semantic meaning of instances over the entire video. Experimental results demonstrated the effectiveness of our SISO baseline, which can handle occlusions in videos. In addition, to tackle the task of VSSIS, we augment the DAVIS-2017 benchmark dataset by assigning semantic ground-truth for salient instance labels, obtaining SEmantic Salient Instance Video (SESIV) dataset. Our SESIV dataset consists of 84 high-quality video sequences with pixel-wisely per-frame ground-truth labels.
机译:只关注一个场景中只有突出的语义实例,对于机器人导航和自动驾驶汽车而言,只能在整个场景中的所有物体上看更多的效益。本文将突出区域的凸起推动到视频中以将它们分解为语义有意义的组件,即语义突出的情况。我们为视频语义突出实例分段(VSSI)的新任务提供了基准,即语义_simant - salient对象(Siso)框架。 SISO框架简单且有效,利用两个不同的分割任务的优势,即语义实例分割和突出对象分割,最终将它们熔断为最终结果。在SISO中,我们通过观察语义实例和突出区域之间的重叠像素来引入顺序融合,以逐一具有非重叠实例。我们还介绍了一种复制实例传播以改进实例的形状和语义含义,以及身份跟踪,以维护整个视频上的实例的身份和语义含义。实验结果表明了我们的SISO基线的有效性,可以处理视频中的闭塞。此外,为了解决vssis的任务,我们通过为SALICE实例标签分配语义地面真值来增强DAVIS-2017基准数据集,获取语义突出实例视频(SESIV)数据集。我们的Sesiv DataSet由84个高质量的视频序列组成,具有Pixel明智的每个帧地面真值标签。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号