...
首页> 外文期刊>Image and Vision Computing >Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification
【24h】

Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

机译:具有动态关键帧选择和失真感知功能整流的语义视频分段

获取原文
获取原文并翻译 | 示例
           

摘要

The per-frame segmentation methods have a high computational cost, thereby, these methods are insufficient to cope with the fast inference need of semantic video segmentation. To efficaciously reuse the extracted features by feature propagation, in this paper, we present distortion-aware feature rectification and online selection of keyframes for fast and accurate video segmentation. The proposed dynamic keyframe scheduling scheme is based on the extent of temporal variations using reinforcement learning. We employ policy gradient reinforcement strategy to learn policy function for maximizing the expected reward. The policy network has two actions (key and non-key) in the action space. State information is derived from the element-wise difference frame of the current frame and the warped current frame generated by the propagated previous frame. Afterward, an adaptive partial feature rectification with distortion-aware corrections is performed for the warped features of the non-key frames. Precise feature propagation is a critical task to uphold the temporal updates in the video sequence since it enormously affects the accuracy as well as the throughput of the whole video analysis framework. The distorted feature maps are revised with the light-weight feature extractor by the guidance of the distortion map while the correctly propagated features are not influenced. Deep feature flow approach is adopted for feature propagation. We evaluate our scheme on the Cityscapes and CamVid datasets with DeepLabv3 as segmentation network and LiteFlowNet for computing flow fields. Experimental results show that the proposed method outperforms the previous state-of-the-art methods significantly both in terms of accuracy and throughput. (c) 2021 Elsevier B.V. All rights reserved.
机译:每个帧分割方法具有高计算成本,从而,这些方法不足以应对语义视频分段的快速推断。在本文中,为了通过特征传播进行有效地重用提取的特征,我们呈现了失真感知功能整流和在线选择关键帧,以实现快速和准确的视频分段。所提出的动态关键帧调度方案基于使用增强学习的时间变化程度。我们采用政策梯度强化策略来学习最大化预期奖励的政策功能。策略网络在动作空间中有两个操作(键和非键)。状态信息源自当前帧的元素 - 方向差异帧和由传播的先前帧生成的翘曲当前帧。之后,对非关键帧的翘曲特征执行具有失真感知校正的自适应部分特征整流。精确的特征传播是一个关键任务,以维护视频序列中的时间更新,因为它极大地影响了整个视频分析框架的准确性以及吞吐量。通过扭曲映射的指导,在正确的传播特征不影响的同时,使用轻量级特征提取器修改失真的特征映射。采用深度特征流法进行特征传播。我们在CityCAPES和CAMVID数据集中评估了DeePlabv3作为分段网络和LiteFlownet的Camvid数据集,用于计算流场。实验结果表明,该方法在准确性和吞吐量方面显着优于先前的最先进的方法。 (c)2021 elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号