【24h】

An efficient implementation of FFT based on CGRA

机译:基于CGRA的FFT高效实现

获取原文

摘要

This paper presents an efficient implementation of complex FFT algorithm on REMUS-II_MB, which is a CGRA-based reconfigurable architecture. The implementation is divided into two steps. The local sequential stages are performed on the RCAs independently at the first step and the cross parallel stages with data communications are processed at the second stage. The performance of this work is improved by employing two technologies, namely pipeline bubble elimination and data block location rearrangement. Compared with other parallel FFT implementations, the proposed one on REMUS-II_MB has the performance advantage by 1.15 to 12.6 times with little local memory cost.
机译:本文提出了一种基于CGRA的可重构体系结构REMUS-II_MB的高效FFT算法。实现分为两个步骤。本地顺序阶段在第一步独立地在RCA上执行,而具有数据通信的并行并行阶段则在第二阶段进行处理。通过采用两种技术,即消除管道气泡和数据块位置重排,可以提高这项工作的性能。与其他并行FFT实现相比,在REMUS-II_MB上提出的方法具有1.15到12.6倍的性能优势,而本地存储成本却很少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号