首页> 外文会议>VLSI Circuits and Systems II pt.2 >Parallel-Pipeline 2-D DCT/IDCT Processor Chip
【24h】

Parallel-Pipeline 2-D DCT/IDCT Processor Chip

机译:并行管道二维DCT / IDCT处理器芯片

获取原文
获取原文并翻译 | 示例

摘要

This paper describes the architecture of an 8x8 2-D DCT/IDCT processor with high throughput and a cost-effective architecture. The 2D DCT/IDCT is calculated using the separability property, so that its architecture is made up of two 1-D processors and a transpose buffer (TB) as intermediate memory. This transpose buffer presents a regular structure based on D-type flip-flops with a double serial input/output data-flow very adequate for pipeline architectures. The processor has been designed with parallel and pipeline architecture to attain high throughput, reduced hardware and maximum efficiency in all arithmetic elements. This architecture allows that the processing elements and arithmetic units work in parallel at half the frequency of the data input rate, except for normalization of transform which it is done in a multiplier operating at maximum frequency. Moreover, it has been verified that the precision analysis of the proposed processor meets the demands of IEEE Std. 1180-1990 used in video codecs ITU-T H.261 and ITU-T H.263. This processor has been conceived using a standard cell design methodology and manufactured in a 0.35-μm CMOS CSD 3M/2P 3.3V process. It has an area of 6.25 mm~2 (the core is 3mm~2)and contains a total of 11.7k gates, of which 5.8k gates are flip-flops. A data input rate frequency of 300MHz has been established with a latency of 172 cycles for the 2-D DCT and 178 cycles for the 2-D IDCT. The computing time of a block is close to 580ns. Its performances in computing speed as well as hardware complexity indicate that the proposed design is suitable for HDTV applications.
机译:本文介绍了具有高吞吐量和具有成本效益的架构的8x8 2-D DCT / IDCT处理器的架构。 2D DCT / IDCT使用可分离性属性进行计算,因此其架构由两个1-D处理器和一个作为中间存储器的转置缓冲区(TB)组成。该转置缓冲器提供了基于D型触发器的常规结构,并具有非常适合流水线体系结构的双串行输入/输出数据流。该处理器采用并行和流水线架构设计,可在所有算术元素中实现高吞吐量,减少硬件并实现最高效率。这种结构允许处理元件和算术单元以数据输入速率一半的频率并行工作,除了在以最大频率工作的乘法器中完成变换的归一化之外。此外,已经证实所提出的处理器的精度分析满足IEEE Std的要求。 1180-1990用于视频编解码器ITU-T H.261和ITU-T H.263。该处理器采用标准单元设计方法构想而成,并以0.35-μmCMOS CSD 3M / 2P 3.3V工艺制造。它的面积为6.25 mm〜2(核心为3mm〜2),总共包含11.7k个门,其中5.8k个门是触发器。已经建立了300MHz的数据输入速率频率,其中2-D DCT的延迟为172个周期,而2-D IDCT的延迟为178个周期。一个块的计算时间接近580ns。它在计算速度以及硬件复杂性方面的性能表明,该设计适合HDTV应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号