【24h】

Efficient Wavefront Parallel Processing for HEVC CABAC Decoding

机译:HEVC CABAC解码的高效波前并行处理

获取原文

摘要

Context-based Adaptive Binary Arithmetic Coding (CABAC) is the only compute-intensive task in the High Efficiency Video Coding (HEVC) Standard that does not contain significant data-level parallelism. As a result, it is often a throughput bottleneck for the overall decoding process, especially for high-quality videos. Consequently, the use of high-level parallelization techniques is inevitable to reach throughput requirements for CABAC decoding. Multiple high-level parallelization tools are specified in HEVC, amongst which Wavefront Parallel Processing (WPP) has only small losses in coding efficiency. However, it lacks in parallel efficiency due to a ramp-up and -down in active parallel threads within a frame. This is a serious problem for systems that cannot process multiple frames at the same time due to performance or memory constraints (e.g. mobile devices), and also for low-delay applications such as video conferencing. To address this issue, we present three improved WPP implementations for HEVC CABAC decoding. They differ in the granularity at which dependency checks are performed. The improvement comes from increased parallel efficiency of the WPP implementation while using the same number of threads as conventional WPP. The proposed implementations allow speedups up to $1.83 imes$ with very little implementation overhead.
机译:基于上下文的自适应二进制算术编码(CABAC)是高效视频编码(HEVC)标准中唯一不包含大量数据级并行性的计算密集型任务。结果,它通常是整个解码过程的吞吐量瓶颈,尤其是对于高质量视频而言。因此,不可避免地会使用高级并行化技术来满足CABAC解码的吞吐量要求。 HEVC中指定了多种高级并行化工具,其中波前并行处理(WPP)在编码效率方面的损失很小。但是,由于帧内活动并行线程的上升和下降,因此缺乏并行效率。对于由于性能或存储器限制而不能同时处理多个帧的系统(例如,移动设备)以及对于诸如视频会议之类的低延迟应用而言,这是一个严重的问题。为了解决这个问题,我们提出了三种用于HEVC CABAC解码的改进的WPP实现。它们在执行依赖性检查的粒度上有所不同。改进来自WPP实现的并行效率提高,同时使用了与常规WPP相同数量的线程。所提出的实现方案允许加速高达$ 1.83 \ times $,而实现开销却很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号