Label propagation in RGB-D video

机译：RGB-D视频中的标签传播

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new method for the propagation of semantic labels in RGB-D video of indoor scenes given a set of ground truth keyframes. Manual labeling of all pixels in every frame of a video sequence is labor intensive and costly, yet required for training and testing of semantic segmentation methods. The availability of video enables propagation of labels between the frames for obtaining a large amounts of annotated pixels. While previous methods commonly used optical flow motion cues for label propagation, we present a novel approach using the camera poses and 3D point clouds for propagating the labels in superpixels computed on the unannotated frames of the sequence. The propagation task is formulated as an energy minimization problem in a Conditional Random Field (CRF). We performed experiments on 8 video sequences from SUN3D dataset [1] and showed superior performance to an optical flow based label propagation approach. Furthermore, we demonstrated that the propagated labels can be used to learn better models using data hungry deep convolutional neural network (DCNN) based approaches for the task of semantic segmentation. The approach demonstrates an increase in performance when the ground truth keyframes are combined with the propagated labels during training.

机译：我们提出了一种在给定地面真实关键帧的情况下在室内场景的RGB-D视频中传播语义标签的新方法。手动标记视频序列每一帧中的所有像素是费力且昂贵的，但是对于语义分割方法的训练和测试却是必需的。视频的可用性使标签在帧之间传播，以获得大量带注释的像素。尽管以前的方法通常使用光流运动线索进行标签传播，但我们提出了一种使用相机姿态和3D点云的新方法，用于在序列的未注释帧上计算的超像素中传播标签。传播任务被公式化为条件随机场（CRF）中的能量最小化问题。我们对SUN3D数据集的8个视频序列进行了实验[1]，并显示了优于基于光流的标签传播方法的性能。此外，我们证明了传播的标签可用于基于数据饥饿的深度卷积神经网络（DCNN）的方法用于语义分割的学习更好的模型。当在训练过程中将地面真实关键帧与传播的标签组合在一起时，该方法证明了性能的提高。

著录项

来源
《IEEE/RSJ International Conference on Intelligent Robots and Systems》|2017年|4917-4922|共6页
会议地点
作者
Md. Alimoor Reza; Hui Zheng; Georgios Georgakis; Jana Košecká;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Three-dimensional displays; Semantics; Video sequences; Training; Cameras; Optical imaging;

机译：三维显示器;语义;视频序列;培训;相机;光学成像;

相似文献

外文文献
中文文献
专利

1. Fast constrained person identity label propagation in stereo videos using a pruned similarity matrix [J] . Kakaletsis Efstratios, Zoidi Olga, Tsingalis Ioannis, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2018,第期

机译：使用修剪的相似性矩阵在立体视频中快速受约束的人身份标签传播
2. A parameter-free label propagation algorithm for person identification in stereo videos [J] . Zhang Chongsheng, Bi Jingjun, Liu Changchang, Neurocomputing . 2016,第DECa19期

机译：用于立体视频中人身识别的无参数标签传播算法
3. Person Identity Label Propagation in Stereo Videos [J] . Zoidi O., Tefas A., Nikolaidis N., Multimedia, IEEE Transactions on . 2014,第5期

机译：立体视频中人的身份标签传播
4. Label propagation in RGB-D video [C] . Md. Alimoor Reza, Hui Zheng, Georgios Georgakis, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2017

机译：RGB-D视频中的标签传播
5. Object Recognition and Semantic Scene Labeling for RGB-D Data. [D] . Lai, Kevin Kar Wai. 2013

机译：RGB-D数据的对象识别和语义场景标记。
6. Helping the Blind to Get through COVID-19: Social Distancing Assistant Using Real-Time Semantic Segmentation on RGB-D Video [O] . Manuel Martinez, Kailun Yang, Angela Constantinescu, 2020

机译：帮助盲人通过Covid-19：在RGB-D视频上使用实时语义分割来实现社交疏散助理
7. SALT: A Semi-automatic Labeling Tool for RGB-D Video Sequences [O] . Dennis Stumpf, Stephan Krauß, Gerd Reis, 2021

机译：盐：用于RGB-D视频序列的半自动标记工具

Label propagation in RGB-D video

摘要

著录项

相似文献

相关主题

期刊订阅