首页> 外文会议>Data Compression Conference >A Dual-Critic Reinforcement Learning Framework for Frame-Level Bit Allocation in HEVC/H.265

【24h】

A Dual-Critic Reinforcement Learning Framework for Frame-Level Bit Allocation in HEVC/H.265

机译：HEVC / H.265中帧级位分配的双重批评加强学习框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a dual-critic reinforcement learning (RL) framework to address the problem of frame-level bit allocation in HEVC/H.265. The objective is to minimize the distortion of a group of pictures (GOP) under a rate constraint. Previous RL-based methods tackle such a constrained optimization problem by maximizing a single reward function that often combines a distortion and a rate reward. However, the way how these rewards are combined is usually ad hoc and may not generalize well to various coding conditions and video sequences. To overcome this issue, we adapt the deep deterministic policy gradient (DDPG) reinforcement learning algorithm for use with two critics, with one learning to predict the distortion reward and the other the rate reward. In particular, the distortion critic works to update the agent when the rate constraint is satisfied. By contrast, the rate critic makes the rate constraint a priority when the agent goes over the bit budget. Experimental results on commonly used datasets show that our method outperforms the bit allocation scheme in x265 and the single-critic baseline by a significant margin in terms of rate-distortion performance while offering fairly precise rate control.

机译：本文介绍了双重批评强化学习（RL）框架，用于解决HEVC / H.265中的帧级位分配问题。目标是在速率约束下最小化一组图片（GOP）的失真。以前的基于RL的方法通过最大化通常结合失真和速率奖励的单个奖励函数来解决这种受约束的优化问题。然而，如何组合这些奖励的方式通常是临时ad hoc，并且可能对各种编码条件和视频序列概括。为了克服这个问题，我们适应了与两个批评者一起使用的深度确定性政策梯度（DDPG）加强学习算法，其中一个学习预测失真奖励和其他速率奖励。特别是，在满足速率约束时，失真批评者会用于更新代理。相比之下，速率批评者使得速率约束在代理商超过位预算时优先考虑。常用数据集上的实验结果表明，我们的方法在速率 - 失真性能方面，在X265中的比特分配方案和单一评分基线的比分分配方案在提供了相当精确的速率控制的同时。

著录项

来源
《Data Compression Conference 》|2021年|13-22|共10页
会议地点
作者
Yung-Han Ho; Guo-Lun Jin; Yun Liang; Wen-Hsiao Peng; Xiaobo Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Bit rate; Video sequences; Rate-distortion; Reinforcement learning; Rate distortion theory; Distortion;

机译：训练;比特率;视频序列;速率失真;加强学习;速率失真理论;失真;

相似文献

外文文献
中文文献
专利

1. Efficient frame-level bit allocation algorithm for H.265/HEVC [J] . Jing He, Fuzheng Yang Image Processing, IET . 2017 ,第4期

机译：H.265 / HEVC的高效帧级比特分配算法
2. Frame-level Bit Allocation Optimization Based on Video Content Characteristics for HEVC [J] . ZHAOQING PAN, XIAOKAI YI, YUN ZHANG, ACM transactions on multimedia computing communications and applications . 2020 ,第1期

机译：基于HEVC视频内容特征的帧级位分配优化
3. DCT Coefficient Distribution Modeling and Quality Dependency Analysis Based Frame-Level Bit Allocation for HEVC [J] . Gao Wei, Kwong Sam, Yuan Hui, Circuits and Systems for Video Technology, IEEE Transactions on . 2016 ,第1期

机译：基于DCT系数分布模型和质量相关性分析的HEVC帧级比特分配
4. Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation [C] . Lian-Ching Chen, Jun-Hao Hu, Wen-Hsiao Peng 2018 IEEE 23rd International Conference on Digital Signal Processing . 2018

机译：HEVC / H.265帧级比特分配的强化学习
5. A Reinforcement Learning-based Framework for Resource Allocation and Task Assignment in Mobile Edge Computing Networks [D] . Hsieh, Li-Tse. 2021

机译：基于加强学习的移动边缘计算网络中的资源分配和任务分配框架
6. Low-Complexity and Hardware-Friendly H.265/HEVC Encoder for Vehicular Ad-Hoc Networks [O] . Xiantao Jiang, Jie Feng, Tian Song, 2019

机译：适用于车载Ad-Hoc网络的低复杂度且硬件友好的H.265 / HEVC编码器
7. A Dual-Critic Reinforcement Learning Framework for Frame-Level Bit Allocation in HEVC/H.265 [O] . Yung-Han Ho, Guo-Lun Jin, Yun Liang, 2021

机译：HEVC / H.265中帧级位分配的双重批评加强学习框架

A Dual-Critic Reinforcement Learning Framework for Frame-Level Bit Allocation in HEVC/H.265

摘要

著录项

相似文献

相关主题

期刊订阅