首页> 外文会议>IEEE Symposium on Field-Programmable Custom Computing Machines >Fine-tuning Loop-level Parallelism for Increasing Performance of DSP Applications on FPGAs
【24h】

Fine-tuning Loop-level Parallelism for Increasing Performance of DSP Applications on FPGAs

机译:微调环级并行性,用于提高DSP应用对FPGA的性能

获取原文

摘要

This paper discusses the balance between loop-level parallelism and clock rate for enhancing the performance of DSP applications fully implemented on FPGAs. Loop-level parallelism reduces the total cycles of an application at the cost of increased routing complexity that often results in lower clock rates. We analyze loops that can be fully parallelized and show that it is possible to achieve better performance by controlling the number of parallel iterations of the loops than using fully parallel loops. We have implemented loop parallelism in our compilation framework and fine-tune them to enhance the performance of DSP applications that target Xilinx Virtex-II FPGA chip. Our experimental results show that it is possible to reach a performance equilibrium point where the total number of cycles and the overall clock frequency can be adjusted to maximize the overall performance of an application.
机译:本文讨论了环路平行度和时钟速率之间的平衡,以提高在FPGA上完全实现的DSP应用程序的性能。循环级并行性降低了应用程序的总循环,以增加的路由复杂性,通常会导致较低的时钟速率。我们分析了可以完全并行化的循环,并表明可以通过控制循环的并行迭代的数量来实现更好的性能,而不是使用完全并行环路。我们在编译框架中实现了循环并行性,并进行微调,以增强目标Xilinx Virtex-II FPGA芯片的DSP应用的性能。我们的实验结果表明,可以调整循环总数和整体时钟频率的总数达到性能平衡点,以最大限度地提高应用的整体性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号