首页> 外文期刊>Journal of visual communication & image representation >Reinforcement learning based coding unit early termination algorithm for high efficiency video coding
【24h】

Reinforcement learning based coding unit early termination algorithm for high efficiency video coding

机译:基于加强学习的编码单元高效视频编码的早期终止算法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we propose a Reinforcement Learning (RL) based Coding Unit (CU) early termination algorithm for High Efficiency Video Coding (HEVC). RL is utilized to learn a CU early termination classifier independent of depths for low complexity video coding. Firstly, we model the process of CU decision as a Markov Decision Process (MDP) according to the Markov property of CU decision. Secondly, based on the MDP, a CU early termination classifier independent of depths is learned from trajectories of CU decision across different depths with the end-to-end actor-critic RL algorithm. Finally, a CU decision early termination algorithm is introduced with the learned classifier, so as to reduce computational complexity of CU decision. We implement the proposed scheme with different neural network structures. Two different neural network structures are utilized in the implementation of RL based video encoder, which are evaluated to reduce video coding complexity by 34.34% and 43.33%. With regard to Bjontegaard delta peak signal-to-noise ratio and Bjontegaard delta bit rate, the results are -0.033 dB and 0.85%, -0.099 dB and 2.56% respectively on average under low delay B main configuration, when compared with the HEVC test model version 16.5. (C) 2019 Elsevier Inc. All rights reserved.
机译:在本文中,我们提出了一种基于加强学习(RL)的编码单元(CU)早期终止算法,用于高效率视频编码(HEVC)。 RL用于学习CU早期终端分类器,独立于低复杂度视频编码的深度。首先,根据Cu决策的马尔可夫属性,将Cu决策的过程模拟Cu决策作为马尔可夫决策过程(MDP)。其次,基于MDP,从不同深度的CU决策的轨迹与端到端的演员 - 评论仪RL算法的轨迹学习了独立于深度的Cu早期终止分类器。最后,利用学习分类器引入了CU决策早期终止算法,以降低Cu决策的计算复杂性。我们实施了具有不同神经网络结构的提出方案。在基于RL的视频编码器的实现中,使用了两个不同的神经网络结构,从而评估了34.34%和43.33%的视频编码复杂度。关于Bjontegaard Delta峰值信噪比和Bjontegaard Delta比特率,结果分别为-0.033dB和0.85%,-0.099dB和2.56%,与HEVC测试相比,在低延迟B主要配置下平均值平均值。型号16.5。 (c)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号