Dynamic Video Segmentation Network

机译：动态视频分割网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a detailed design of dynamic video segmentation network (DVSNet) for fast and efficient semantic video segmentation. DVSNet consists of two convolutional neural networks: a segmentation network and a flow network. The former generates highly accurate semantic segmentations, but is deeper and slower. The latter is much faster than the former, but its output requires further processing to generate less accurate semantic segmentations. We explore the use of a decision network to adaptively assign different frame regions to different networks based on a metric called expected confidence score. Frame regions with a higher expected confidence score traverse the flow network. Frame regions with a lower expected confidence score have to pass through the segmentation network. We have extensively performed experiments on various configurations of DVSNet, and investigated a number of variants for the proposed decision network. The experimental results show that our DVSNet is able to achieve up to 70.4% mIoU at 19.8 fps on the Cityscape dataset. A high speed version of DVSNet is able to deliver an fps of 30.4 with 63.2% mIoU on the same dataset. DVSNet is also able to reduce up to 95% of the computational workloads.

机译：在本文中，我们提出了一种用于快速有效的语义视频分割的动态视频分割网络（DVSNet）的详细设计。 DVSNet由两个卷积神经网络组成：分段网络和流网络。前者生成高度准确的语义分段，但深度和速度较慢。后者比前者快得多，但是它的输出需要进一步处理以生成不太准确的语义分段。我们探索使用决策网络根据称为预期置信度得分的指标将不同的帧区域自适应地分配给不同的网络。具有较高预期置信度得分的框架区域遍历流动网络。预期置信度得分较低的框架区域必须通过分割网络。我们在DVSNet的各种配置上进行了广泛的实验，并研究了所建议决策网络的许多变体。实验结果表明，我们的DVSNet可以在Cityscape数据集上以19.8 fps的速度达到70.4％的mIoU。高速版本的DVSNet可以在同一数据集上以63.2％的mIoU提供30.4的fps。 DVSNet还能够减少多达95％的计算工作量。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|6556-6565|共10页
会议地点 Salt Lake City(US)
作者
Yu-Syuan Xu; Tsu-Jui Fu; Hsuan-Kung Yang; Chun-Yi Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Image segmentation; Feature extraction; Computer architecture; Video sequences; Acceleration; Adaptive scheduling;

机译：语义学图像分割特征提取;计算机架构;视频序列；加速；自适应调度;
入库时间 2022-08-26 14:35:28

相似文献

外文文献
中文文献
专利

1. Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks [J] . Ziyi Liu, Le Wang, Gang Hua, IEEE Transactions on Image Processing . 2018,第12期

机译：耦合动态马尔可夫网络的联合视频对象发现和分割
2. Dynamic Warping Network for Semantic Video Segmentation [J] . Jiangyun Li, Yikai Zhao, Xingjian He, Complexity . 2021,第a期

机译：语义视频分割动态翘曲网络
3. Variable segmentation based on intrinsic video rate characteristics to transport pre-stored video across networks [J] . Huirong Fu, Liren Zhang International journal of communication systems . 2003,第10期

机译：基于固有视频速率特征的可变分段，以跨网络传输预存储的视频
4. Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries [C] . Hao Wang, Cheng Deng, Fan Ma, AAAI Conference on Artificial Intelligence . 2020

机译：语言查询的Contor和Action视频分段的上下文调制动态网络
5. Efficient multi-view video coding scheme based on dynamic video object segmentation. [D] . Wei, Xiaohui. 2007

机译：基于动态视频对象分割的高效多视图视频编码方案。
6. Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network [O] . Mona Kirstin Fehling, Fabian Grosch, Maria Elke Schuster, 2020

机译：使用深度卷积LSTM网络对喉镜内窥镜高速视频中的声门和声带进行全自动分割
7. Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries [O] . Hao Wang, Cheng Deng, Fan Ma, 2020

机译：语言查询的演员和动作视频分段的上下文调制动态网络

Dynamic Video Segmentation Network

摘要

著录项

相似文献

相关主题

期刊订阅