...
首页> 外文期刊>Circuits and Systems II: Express Briefs, IEEE Transactions on >A Memory-Efficient Scalable Architecture for Lifting-Based Discrete Wavelet Transform
【24h】

A Memory-Efficient Scalable Architecture for Lifting-Based Discrete Wavelet Transform

机译:基于提升的离散小波变换的内存高效可扩展架构

获取原文
获取原文并翻译 | 示例
           

摘要

In this brief, we propose a new parallel lifting-based 2-D DWT architecture with high memory efficiency and short critical path. The memory efficiency is achieved with a novel scanning method that enables tradeoff of external memory bandwidth and on-chip memory. Based on the data flow graph of the flipped lifting algorithm, processing units (PUs) are developed for maximally utilizing the inherent parallelism. With S number of PUs, the throughput can be scaled while keeping the latency constant. Compared with the best existing architecture, the proposed architecture requires less memory. For an N × N image, the proposed architecture consumes a total of only 3N + 24S words of transposition memory, temporal memory, and pipeline registers. The synthesized results in a 90-nm CMOS process show that it achieves better area-delay products than the best existing design by 32.3%, 31.5%, and 27.0% when S = 2, 4, and 8, respectively, and by 26%, 26%, and 22% when the overhead for buffering the required overlapped pixels is taken into account.
机译:在本简介中,我们提出了一种新的基于并行提升的二维DWT结构,该结构具有高存储效率和短关键路径。通过一种新颖的扫描方法可以实现存储效率,该扫描方法可以权衡外部存储器带宽和片上存储器。基于翻转提升算法的数据流图,开发了处理单元(PU),以最大程度地利用固有的并行性。使用S个PU,可以在保持等待时间恒定的同时缩放吞吐量。与最佳的现有体系结构相比,提出的体系结构需要更少的内存。对于N×N图像,所提出的体系结构仅消耗3N + 24S字的换位存储器,时间存储器和流水线寄存器。在90纳米CMOS工艺中的综合结果表明,当S = 2、4和8时,与最佳的现有设计相比,它可以获得比最佳现有设计更好的面积延迟产品,分别为32.3%,31.5%和27.0%,以及26%。 ,当考虑到用于缓冲所需的重叠像素的开销时,分别为26%和22%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号