Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

Awan Mehwish; Shin Jitae

首页> 外文期刊>Image and Vision Computing >Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

【24h】

Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

机译：具有动态关键帧选择和失真感知功能整流的语义视频分段

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The per-frame segmentation methods have a high computational cost, thereby, these methods are insufficient to cope with the fast inference need of semantic video segmentation. To efficaciously reuse the extracted features by feature propagation, in this paper, we present distortion-aware feature rectification and online selection of keyframes for fast and accurate video segmentation. The proposed dynamic keyframe scheduling scheme is based on the extent of temporal variations using reinforcement learning. We employ policy gradient reinforcement strategy to learn policy function for maximizing the expected reward. The policy network has two actions (key and non-key) in the action space. State information is derived from the element-wise difference frame of the current frame and the warped current frame generated by the propagated previous frame. Afterward, an adaptive partial feature rectification with distortion-aware corrections is performed for the warped features of the non-key frames. Precise feature propagation is a critical task to uphold the temporal updates in the video sequence since it enormously affects the accuracy as well as the throughput of the whole video analysis framework. The distorted feature maps are revised with the light-weight feature extractor by the guidance of the distortion map while the correctly propagated features are not influenced. Deep feature flow approach is adopted for feature propagation. We evaluate our scheme on the Cityscapes and CamVid datasets with DeepLabv3 as segmentation network and LiteFlowNet for computing flow fields. Experimental results show that the proposed method outperforms the previous state-of-the-art methods significantly both in terms of accuracy and throughput. (c) 2021 Elsevier B.V. All rights reserved.

机译：每个帧分割方法具有高计算成本，从而，这些方法不足以应对语义视频分段的快速推断。在本文中，为了通过特征传播进行有效地重用提取的特征，我们呈现了失真感知功能整流和在线选择关键帧，以实现快速和准确的视频分段。所提出的动态关键帧调度方案基于使用增强学习的时间变化程度。我们采用政策梯度强化策略来学习最大化预期奖励的政策功能。策略网络在动作空间中有两个操作（键和非键）。状态信息源自当前帧的元素 - 方向差异帧和由传播的先前帧生成的翘曲当前帧。之后，对非关键帧的翘曲特征执行具有失真感知校正的自适应部分特征整流。精确的特征传播是一个关键任务，以维护视频序列中的时间更新，因为它极大地影响了整个视频分析框架的准确性以及吞吐量。通过扭曲映射的指导，在正确的传播特征不影响的同时，使用轻量级特征提取器修改失真的特征映射。采用深度特征流法进行特征传播。我们在CityCAPES和CAMVID数据集中评估了DeePlabv3作为分段网络和LiteFlownet的Camvid数据集，用于计算流场。实验结果表明，该方法在准确性和吞吐量方面显着优于先前的最先进的方法。（c）2021 elestvier b.v.保留所有权利。

著录项

来源
《Image and Vision Computing》 |2021年第6期|104184.1-104184.11|共11页
作者
Awan Mehwish; Shin Jitae;
展开▼
作者单位

Sungkyunkwan Univ Dept Elect & Comp Engn Suwon 16419 South Korea;

Sungkyunkwan Univ Coll Informat & Commun Engn Suwon 16419 South Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantic video segmentation; Feature warping; Distortion-aware feature correction; Policy network; Dynamic keyframe selection scheme; Reinforcement learning; Deep learning;

机译：语义视频分割;功能翘曲;失真感知功能校正;策略网络;动态关键帧选择方案;加强学习;深入学习;

相似文献

外文文献
中文文献
专利

1. Information-theoretic temporal segmentation of video and applications: multiscale keyframes selection and shot boundaries detection [J] . Bruno Janvier, Eric Bruno, Thierry Pun, Multimedia Tools and Applications . 2006,第3期

机译：视频及其应用的信息理论时间分段：多尺度关键帧选择和镜头边界检测
2. ROBUST VISUAL-INERTIAL ODOMETRY IN DYNAMIC ENVIRONMENTS USING SEMANTIC SEGMENTATION FOR FEATURE SELECTION [J] . P. Irmisch, D. Baumbach, I. Ernst ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2020,第5期

机译：使用语义分割进行功能选择的动态环境中的强大的视觉惯性内径术
3. Video semantic segmentation via feature propagation with holistic attention [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：视频语义分割通过具有整体关注的特征传播
4. Online Keyframe Selection Scheme for Semantic Video Segmentation [C] . Mehwish Awan, Jitae Shin IEEE International Conference on Consumer Electronics - Asia . 2020

机译：语义视频分段的在线关键帧选择方案
5. Statistical feature selection and extraction for video and image segmentation. [D] . Song, Xiaomu. 2005

机译：用于视频和图像分割的统计特征选择和提取。
6. A Computer-Aided Diagnosis System for Dynamic Contrast-Enhanced MR Images Based on Level Set Segmentation and ReliefF Feature Selection [O] . Zhiyong Pang, Dongmei Zhu, Dihu Chen, 2015

机译：基于水平集分割和ReliefF特征选择的动态增强MR图像计算机辅助诊断系统
7. Information-theoretic temporal segmentation of videos and applications : multiscale keyframe selection and transition detection [O] . Janvier, Bruno, Bruno, Eric, Marchand-Maillet, Stéphane, 2006

机译：视频及其应用的信息理论时间分段：多尺度关键帧选择和过渡检测

Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

摘要

著录项

相似文献

相关主题

期刊订阅