首页> 外文会议>Canadian Conference on Electrical and Computer Engineering >A FAST, PIPELINED IMPLEMENTATION OF A TWO-DIMENSIONAL INVERSE DISCRETE COSINE TRANSFORM
【24h】

A FAST, PIPELINED IMPLEMENTATION OF A TWO-DIMENSIONAL INVERSE DISCRETE COSINE TRANSFORM

机译:快速,流水线实现二维反离散余弦变换

获取原文

摘要

The Inverse Discrete Cosine Transform (IDCT) is a significant component in today's JPEG and MPEG decoders. Of all the stages in the decoding process of a JPEG file, the IDCT is the most computationally intensive. Hence, we require fast and efficient implementations, either in software or hardware. Numerous individual designs for computing the 1D-IDCT have been proposed. Our 2D-IDCT incorporates two of our 1D-IDCT cores and a transpose network to provide a stall-free pipeline. In this paper, we describe a fast hardware implementation of a two-dimensional IDCT architecture that implements a variation of the modified Loeffler algorithm. This design is currently functionally verified, synthesized and tested on the Xilinx Virtex II FPGA. Our FPGA implementation has a throughput of over 800M coefficients per second, implemented as an eight-wide pipeline with a clock frequency of 102 MHz. We suggest ideas to parallelize the design and further enhance performance. We also describe an ASIC design of the HDL model that operates at a clock frequency of 154 MHz using TSMC's 0.18 μm CMOS technology. Our VHDL implementation will be released as "open source".
机译:逆离散余弦变换(IDCT)是当今JPEG和MPEG解码器中的重要组成部分。在JPEG文件的解码过程中的所有阶段中,IDCT是最具计算密集的。因此,我们需要快速高效的实现,无论是在软件还是硬件中。已经提出了许多用于计算1D-IDCT的单独设计。我们的2D-IDCT包含我们的两个1D-IDCT核和转位网络,以提供无摊位的管道。在本文中,我们描述了一种实现修改后的LOEFFLER算法的变型的二维IDCT架构的快速硬件实现。目前在Xilinx Virtex II FPGA上有效地验证,合成和测试了这种设计。我们的FPGA实施具有每秒超过800米系数的吞吐量,实现为八大管道,时钟频率为102 MHz。我们建议观念并将设计并行化并进一步提高性能。我们还描述了HDL模型的ASIC设计,其使用TSMC的0.18μmCMOS技术以154 MHz的时钟频率运行。我们的VHDL实现将被释放为“开源”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号