首页> 外文会议>Programmable Logic (SPL), 2012 VIII Southern Conference on >Low cost and high throughput multiplierless design of a 16 point 1-D DCT of the new HEVC video coding standard
【24h】

Low cost and high throughput multiplierless design of a 16 point 1-D DCT of the new HEVC video coding standard

机译:新型HEVC视频编码标准的16点一维DCT的低成本,高吞吐量无乘法器设计

获取原文
获取原文并翻译 | 示例

摘要

This paper presents the hardware design of a 16-points 1-D DCT used in the emerging video coding standard HEVC — High Efficiency Video Coding. The 1-D DCT is used by the 16×16 2-D DCT of the HEVC standard. The transforms stage is one of the innovations proposed by HEVC, not only because of the variable size (from 4×4 to 32×32) but also because higher dimension transforms other than the traditional 4×4 and 8×8 are used. The hardware design presented in this work focuses on low cost and high throughput. To achieve such objectives, the 16-points algorithm from HEVC was simplified, so that a more efficient hardware design could be implemented. Some strategies were used during this simplification, such as operations reordering, factoring to compress the length of the operators, multiplications by constant turned into shifts and adds, sub-expressions sharing, among others. The architecture was designed in a fully combinational way in order to reduce hardware overhead. Synthesis results obtained using Altera FPGAs from the Cyclone II and Stratix III families showed hardware resources reduction reaching 72% when compared to an architecture described as a direct transcription of the non-optimized version of the algorithm. Even with a purely combinational implementation, the designed architecture achieved a throughput between 376Msamples/s and 1.4Gsamples/s. With these results, the architecture is capable of processing, in the worst case, more than 30 QFHD frames (3840×2160 pixels) per second. Therefore, the architecture is capable of processing videos with significantly high resolutions in real time. To the best of our knowledge, this is the first work in the literature that presents hardware results for the HEVC transforms.
机译:本文介绍了新兴视频编码标准HEVC(高效视频编码)中使用的16点一维DCT的硬件设计。 HEVC标准的16×16 2-D DCT使用1-D DCT。变换阶段是HEVC提出的创新之一,不仅因为大小可变(从4×4到32×32),而且还因为使用了除传统4×4和8×8之外的更高维度的变换。本工作中介绍的硬件设计侧重于低成本和高吞吐量。为了实现这些目标,简化了HEVC的16点算法,从而可以实现更高效的硬件设计。在此简化过程中使用了一些策略,例如操作重新排序,分解运算符长度的因数,通过常数的乘法转换成移位和加法,共享子表达式等。该架构以完全组合的方式设计,以减少硬件开销。使用从Cyclone II和Stratix III系列的Altera FPGA获得的综合结果表明,与描述为该算法的非优化版本的直接转录的体系结构相比,硬件资源减少了72%。即使采用纯粹的组合实现,所设计的体系结构也可以实现376Msamples / s至1.4Gsamples / s的吞吐量。有了这些结果,该架构能够在最坏的情况下每秒处理30多个QFHD帧(3840×2160像素)。因此,该架构能够实时处理高分辨率的视频。据我们所知,这是文献中的第一篇工作,介绍了HEVC转换的硬件结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号