首页> 外文期刊>IEICE Transactions on fundamentals of electronics, communications & computer sciences >A 48 Cycles/MB H.264/AVC Deblocking Filter Architecture for Ultra High Definition Applications
【24h】

A 48 Cycles/MB H.264/AVC Deblocking Filter Architecture for Ultra High Definition Applications

机译:面向超高清应用的 48 周期/MB H.264/AVC 去块滤波器架构

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, a highly parallel deblocking filter architecture for H.264/AVC is proposed to process one macroblock in 48 clock cycles and give real-time support to QFHD@60 fps sequences at less than 100 MHz. 4 edge filters organized in 2 groups for simultaneously processing vertical and horizontal edges are applied in this architecture to enhance its throughput. While parallelism increases, pipeline hazards arise owing to the latency of edge filters and data dependency of deblocking algorithm. To solve this problem, a zig-zag processing schedule is proposed to eliminate the pipeline bubbles. Data path of the architecture is then derived according to the processing schedule and optimized through data flow merging, so as to minimize the cost of logic and internal buffer. Meanwhile, the architecture's data input rate is designed to be identical to its throughput, while the transmission order of input data can also match the zig-zag processing schedule. Therefore no intercommunication buffer is required between the deblocking filter and its previous component for speed matching or data reordering. As a result, only one 24×64 two-port SRAM as internal buffer is required in this design. When synthesized with SMIC 130nm process, the architecture costs a gate count of 30.2 k, which is competitive considering its high performance.
机译:该文提出一种针对H.264/AVC的高度并行解块滤波器架构,以48个时钟周期处理一个宏块,并实时支持小于100 MHz的QFHD@60 fps序列。随着并行度的增加,由于边缘过滤器的延迟和解块算法的数据依赖性,管道危险也随之而来。为了解决这个问题,提出了一种锯齿形处理方案来消除管道气泡。然后根据处理计划推导架构的数据路径,并通过数据流合并进行优化,从而最大限度地降低逻辑和内部缓冲器的成本。同时,该架构的数据输入速率被设计为与其吞吐量相同,而输入数据的传输顺序也可以与之字形处理计划相匹配。因此,解块滤波器与其前一个组件之间不需要相互通信缓冲区即可进行速度匹配或数据重新排序。因此,此设计中只需要一个 24×64 双端口 SRAM 作为内部缓冲器。采用中芯国际130nm工艺合成时,该架构的栅极数为30.2 k,考虑到其高性能,具有竞争力。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号