首页> 外文期刊>IEEE Transactions on Signal Processing >Novel Memory Reference Reduction Methods for FFT Implementations on DSP Processors
【24h】

Novel Memory Reference Reduction Methods for FFT Implementations on DSP Processors

机译:在DSP处理器上实现FFT的新颖的存储器参考缩减方法

获取原文
获取原文并翻译 | 示例

摘要

Memory references in digital signal processors (DSP) are expensive due to their long latencies and high power consumption. Implementing fast Fourier transform (FFT) algorithms on DSP involves many memory references to access butterfly inputs and twiddle factors. Conventional FFT implementations require redundant memory references to load identical twiddle factors for butterflies from different stages in the FFT diagrams. In this paper, we present novel memory reference reduction methods to minimize memory references due to twiddle factors for implementing various different FFT algorithms on DSP. The proposed methods first group the butterflies with identical twiddle factors from different stages in the FFT diagrams and compute them before computing other butterflies with different twiddle factors, and then reduce the number of twiddle factor lookups by taking advantage of the properties of twiddle factors. Consequently, each twiddle factor is loaded only once and the number of memory references due to twiddle factors can be minimized. We have applied the proposed methods to implement radix-2 DIF FFT algorithm on TI TMS320C64x DSP. Experimental results show the proposed methods can achieve average of 76.4% reduction in the number of memory references, 53.5% saving of memory spaces due to twiddle factors, and average of 36.5% reduction in the number of clock cycles to compute radix-2 DIF FFT on DSP comparing to the conventional implementation. Similar performance gain is reported for implementing radix-2 DIT FFT algorithms using the new methods.
机译:数字信号处理器(DSP)中的内存引用由于延迟时间长和功耗高而价格昂贵。在DSP上实现快速傅立叶变换(FFT)算法涉及许多内存引用,以访问蝶形输入和旋转因子。传统的FFT实现需要冗余的存储器引用,以加载FFT图中来自不同阶段的蝶形的相同旋转因子。在本文中,我们提出了新颖的内存引用减少方法,以最大程度地减少由于旋转因素而在DSP上实现各种不同FFT算法的内存引用。所提出的方法首先将FFT图中来自不同阶段的具有相同旋转因子的蝴蝶进行分组,然后在计算具有不同旋转因子的其他蝴蝶之前对其进行计算,然后通过利用旋转因子的性质来减少旋转因子查找的次数。因此,每个旋转因子仅被加载一次,并且由于旋转因子而导致的存储器引用的数量可以最小化。我们已将提出的方法应用于在TI TMS320C64x DSP上实现radix-2 DIF FFT算法。实验结果表明,所提出的方法可以平均减少76.4%的内存引用数量,由于旋转因素而节省53.5%的内存空间,并且平均可以减少计算radix-2 DIF FFT的时钟周期数量的36.5%。在DSP上与传统实现相比。据报道,使用新方法实现radix-2 DIT FFT算法具有类似的性能增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号