Deep Spatio-Temporal Random Fields for Efficient Video Segmentation

机译：深度时空随机场，用于有效的视频分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work we introduce a time- and memory-efficient method for structured prediction that couples neuron decisions across both space at time. We show that we are able to perform exact and efficient inference on a densely-connected spatio-temporal graph by capitalizing on recent advances on deep Gaussian Conditional Random Fields (GCRFs). Our method, called VideoGCRF is (a) efficient, (b) has a unique global minimum, and (c) can be trained end-to-end alongside contemporary deep networks for video understanding. We experiment with multiple connectivity patterns in the temporal domain, and present empirical improvements over strong baselines on the tasks of both semantic and instance segmentation of videos. Our implementation is based on the Caffe2 framework and will be available at https://github.com/siddharthachandra/gcrf-v3.0.

机译：在这项工作中，我们为结构化预测引入了一种时间和内存效率高的方法，该方法可同时在两个空间上耦合神经元决策。我们表明，我们能够利用深高斯条件随机场（GCRF）的最新进展，对密集连接的时空图执行精确而有效的推断。我们的方法VideoGCRF（a）高效，（b）具有唯一的全局最小值，并且（c）可以与当代深度网络一起进行端到端培训，以进行视频理解。我们在时域中使用多种连接模式进行了实验，并针对视频的语义和实例分割任务在强基准上提出了经验改进。我们的实现基于Caffe2框架，并将在https://github.com/siddharthachandra/gcrf-v3.0上提供。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|8915-8924|共10页
会议地点 Salt Lake City(US)
作者
Siddhartha Chandra; Camille Couprie; Iasonas Kokkinos;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image segmentation; Semantics; Task analysis; Linear systems; Inference algorithms; Prediction algorithms;

机译：图像分割语义学任务分析；线性系统；推理算法；预测算法;
入库时间 2022-08-26 14:35:33

相似文献

外文文献
中文文献
专利

1. Fully automatic person segmentation in unconstrained video using spatio-temporal conditional random fields [J] . Bhole Chetan, Pal Christopher Image and Vision Computing . 2016,第Jula期

机译：使用时空条件随机场在不受约束的视频中进行全自动人分割
2. A simple framework for spatio-temporal video segmentation and delayering using dense motion fields [J] . Piroddi R., Vlachos T. IEEE signal processing letters . 2006,第7期

机译：使用密集运动场进行时空视频分割和延迟的简单框架
3. A framework for continuous fingerspelling spotting for H.264/AVC compressed videos using spatio-temporal Markov random field [J] . Talukdar Anjan Kumar, Bhuyan M. K. Multimedia Tools and Applications . 2021,第18期

机译：使用Spatio-Temporal Markov随机字段的H.264 / AVC压缩视频的连续手指斑点的框架
4. Deep Spatio-Temporal Random Fields for Efficient Video Segmentation [C] . Siddhartha Chandra, Camille Couprie, Iasonas Kokkinos IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：高效视频分割的深度时空随机字段
5. Efficient Algorithms for Markov Random Fields, Isotonic Regression, Graph Fused Lasso, and Image Segmentation [D] . Lu, Cheng. 2017

机译：马尔可夫随机场，等渗回归，图形融合套索和图像分割的高效算法
6. Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation [O] . Le Wang, Xuhuan Duan, Qilin Zhang, 2018

机译：Segment-Tube：具有按帧分割的未修剪视频中的时空行为本地化
7. Deep Spatio-Temporal Random Fields for Efficient Video Segmentation [O] . Siddhartha Chandra, Camille Couprie, Iasonas Kokkinos 2018

机译：高效视频分割的深度时空随机字段

Deep Spatio-Temporal Random Fields for Efficient Video Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅