首页> 外文会议>2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing >A efficient parallel deblocking filter based on GPU: Implementation and optimization
【24h】

A efficient parallel deblocking filter based on GPU: Implementation and optimization

机译:基于GPU的高效并行解块滤波器:实现和优化

获取原文

摘要

The deblocking filter represents one of the most time consuming tasks of the H.264/AVC standard. Due to its characteristics of data dependencies and frequent memory access, it poses an arduous challenge to mapping the algorithm onto massively parallel architecture efficiently. In this paper, a novel parallel deblocking filter is proposed based on GPU, which weaken the dependencies between MBs by rearrange the filter orders of boundaries. We implemented the proposed algorithm on GPU and optimized the program through three strategies, including kernel combination, reusing the intermediate data and optimizing data representation. Experimental results show that applying the proposed parallel method supports real-time processing throughput for 1080p at 450fps. We have also observed 3.78× and 16.68× speedup for comprehensive optimization parallel deblocking filter on two-core processor and the state-of-the-art GPU-based implementation, respectively.
机译:解块滤波器代表H.264 / AVC标准中最耗时的任务之一。由于其数据依赖性和频繁的内存访问的特性,将算法有效地映射到大规模并行体系结构提出了艰巨的挑战。本文提出了一种基于GPU的新型并行解块滤波器,它通过重新排列边界的滤波器顺序来减弱MB之间的依赖性。我们在GPU上实现了该算法,并通过内核组合,重用中间数据和优化数据表示这三种策略对程序进行了优化。实验结果表明,采用本文提出的并行方法可以支持450fps的1080p实时处理吞吐量。我们还观察到分别在两核处理器上的综合优化并行解块滤波器和基于GPU的最新实现的加速3.78倍和16.68倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号