首页> 外文期刊>Journal of VLSI signal processing systems >FPGA Architecture for 2D Discrete Fourier Transform Based on 2D Decomposition for Large-sized Data
【24h】

FPGA Architecture for 2D Discrete Fourier Transform Based on 2D Decomposition for Large-sized Data

机译:基于二维分解的大数据二维离散傅里叶变换的FPGA体系结构

获取原文
获取原文并翻译 | 示例
           

摘要

Applications based on Discrete Fourier Transforms (DFT) are extensively used in several areas of signal and digital image processing. Of particular interest is the two-dimensional (2D) DFT which is more computation- and bandwidth-intensive than the one-dimensional (ID) DFT. Traditionally, a 2D DFT is computed using Row-Column (RC) decomposition, where ID DFTs are computed along the rows followed by ID DFTs along the columns. Both application specific and reconfigurable hardware have utilized this scheme for high-performance implementations of 2D DFT. However, architectures based on RC decomposition are not efficient for large input size data due to memory bandwidth constraints. In this paper, we propose an efficient architecture to implement 2D DFT for large-sized input data based on a novel 2D decomposition algorithm. This architecture achieves very high throughput by exploiting the inherent parallelism due to the algorithm decomposition and by utilizing the row-wise burst access pattern of the external memory. A high throughput memory interface has been designed to enable maximum utilization of the memory bandwidth. In addition, an automatic system generator is provided for mapping this architecture onto a reconfigurable platform of Xilinx Virtex-5 devices. For a 2K x 2K input size, the proposed architecture is 1.96 times faster than RC decomposition based implementation under the same memory constraints, and also outperforms other existing implementations.
机译:基于离散傅立叶变换(DFT)的应用广泛用于信号和数字图像处理的多个领域。尤其值得关注的是二维(2D)DFT,它比一维(ID)DFT的计算和带宽密集度更高。传统上,使用行列(RC)分解来计算2D DFT,其中沿着行计算ID DFT,然后沿着列计算ID DFT。专用硬件和可重新配置硬件都已将此方案用于2D DFT的高性能实现。但是,由于内存带宽限制,基于RC分解的体系结构对于大输入大小的数据效率不高。在本文中,我们提出了一种有效的体系结构,它基于一种新颖的2D分解算法为大型输入数据实现2D DFT。通过利用由于算法分解而产生的固有并行性并利用外部存储器的行式突发访问模式,该体系结构实现了非常高的吞吐量。高吞吐量内存接口已被设计为可以最大程度地利用内存带宽。此外,提供了一个自动系统生成器,用于将该体系结构映射到Xilinx Virtex-5设备的可重新配置平台上。对于2K x 2K的输入大小,在相同的内存约束下,所提出的体系结构比基于RC分解的实现快1.96倍,并且性能优于其他现有实现。

著录项

  • 来源
    《Journal of VLSI signal processing systems》 |2011年第1期|p.109-122|共14页
  • 作者单位

    School of Electrical, Computer and Energy Engineering,Arizona State University, Tempe, AZ, USA;

    Department of Computer Science and Engineering,Pennsylvania State University, University Park, PA, USA;

    School of Electrical, Computer and Energy Engineering,Arizona State University, Tempe, AZ, USA;

    Department of Computer Science and Engineering,Pennsylvania State University, University Park, PA, USA;

    Department of Computer Science and Engineering,Pennsylvania State University, University Park, PA, USA;

    School of Electrical, Computer and Energy Engineering,Arizona State University, Tempe, AZ, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    multi-dimensional signal processing; dft; algorithm-architecture co-design;

    机译:多维信号处理算法架构协同设计;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号