首页> 外文期刊>IEICE Transactions on fundamentals of electronics, communications & computer sciences >Architecture and Circuit Optimization of Hardwired Integer Motion Estimation Engine for H.264/AVC
【24h】

Architecture and Circuit Optimization of Hardwired Integer Motion Estimation Engine for H.264/AVC

机译:H.264 / AVC硬连线整数运动估计引擎的体系结构和电路优化

获取原文
获取原文并翻译 | 示例

摘要

Variable block size motion estimation developed by the latest video coding standard H.264/AVC is the efficient approach to reduce the temporal redundancies. The intensive computational complexity coming from the variable block size technique makes the hardwired accelerator essential, for real-time applications. Propagate partial sums of absolute differences (Propagate Partial SAD) and SAD Tree hardwired engines outperform other counterparts, especially considering the impact of supporting variable block size technique. In this paper, the authors apply the architecture-level and the circuit-level approaches to improve the maximum operating frequency and reduce the hardware overhead of Propagate Partial SAD and SAD Tree, while other metrics, in terms of latency, memory bandwidth and hardware utilization, of the original architectures are maintained. Experiments demonstrate that by using the proposed approaches, at 110.8 MHz operating frequency, compared with the original architectures, 14.7% and 18.0% gate count can be saved for Propagate Partial SAD and SAD Tree, respectively. With TSMC 0.18μm 1P6M CMOS technology, the proposed Propagate Partial SAD architecture achieves 231.6 MHz operating frequency at a cost of 84.1 k gates. Correspondingly, the maximum work frequency of the optimized SAD Tree architecture is improved to 204.8 MHz, which is almost two times of the original one, while its hardware overhead is merely 88.5 k-gate.
机译:最新的视频编码标准H.264 / AVC开发的可变块大小运动估计是减少时间冗余的有效方法。可变块大小技术带来的大量计算复杂性使硬连线加速器对于实时应用至关重要。传播绝对差的部分和(Propagate Partial SAD)和SAD Tree硬连线引擎优于其他同类引擎,尤其是考虑到支持可变块大小技术的影响。在本文中,作者采用架构级别和电路级别的方法来提高最大工作频率并减少Propagate Partial SAD和SAD Tree的硬件开销,而其他指标则涉及延迟,内存带宽和硬件利用率。保留了原始架构的。实验表明,通过使用所提出的方法,在110.8 MHz的工作频率下,与原始体系结构相比,可以将Propagate Partial SAD和SAD Tree的门数分别节省14.7%和18.0%。借助台积电0.18μm1P6M CMOS技术,拟议的Propagate Partial SAD架构可实现231.6 MHz的工作频率,而栅极成本为84.1 k。相应地,优化后的SAD Tree体系结构的最大工作频率提高到204.8 MHz,几乎是原来的两倍,而其硬件开销仅为88.5 k门。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号