首页> 外文会议>Data Compression Conference >A Dual-Critic Reinforcement Learning Framework for Frame-Level Bit Allocation in HEVC/H.265
【24h】

A Dual-Critic Reinforcement Learning Framework for Frame-Level Bit Allocation in HEVC/H.265

机译:HEVC / H.265中帧级位分配的双重批评加强学习框架

获取原文

摘要

This paper introduces a dual-critic reinforcement learning (RL) framework to address the problem of frame-level bit allocation in HEVC/H.265. The objective is to minimize the distortion of a group of pictures (GOP) under a rate constraint. Previous RL-based methods tackle such a constrained optimization problem by maximizing a single reward function that often combines a distortion and a rate reward. However, the way how these rewards are combined is usually ad hoc and may not generalize well to various coding conditions and video sequences. To overcome this issue, we adapt the deep deterministic policy gradient (DDPG) reinforcement learning algorithm for use with two critics, with one learning to predict the distortion reward and the other the rate reward. In particular, the distortion critic works to update the agent when the rate constraint is satisfied. By contrast, the rate critic makes the rate constraint a priority when the agent goes over the bit budget. Experimental results on commonly used datasets show that our method outperforms the bit allocation scheme in x265 and the single-critic baseline by a significant margin in terms of rate-distortion performance while offering fairly precise rate control.
机译:本文介绍了双重批评强化学习(RL)框架,用于解决HEVC / H.265中的帧级位分配问题。目标是在速率约束下最小化一组图片(GOP)的失真。以前的基于RL的方法通过最大化通常结合失真和速率奖励的单个奖励​​函数来解决这种受约束的优化问题。然而,如何组合这些奖励的方式通常是临时ad hoc,并且可能对各种编码条件和视频序列概括。为了克服这个问题,我们适应了与两个批评者一起使用的深度确定性政策梯度(DDPG)加强学习算法,其中一个学习预测失真奖励和其他速率奖励。特别是,在满足速率约束时,失真批评者会用于更新代理。相比之下,速率批评者使得速率约束在代理商超过位预算时优先考虑。常用数据集上的实验结果表明,我们的方法在速率 - 失真性能方面,在X265中的比特分配方案和单一评分基线的比分分配方案在提供了相当精确的速率控制的同时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号