首页> 外文会议> >An integrated and automated memory optimization flow for FPGA behavioral synthesis
【24h】

An integrated and automated memory optimization flow for FPGA behavioral synthesis

机译:用于FPGA行为综合的集成式自动存储器优化流程

获取原文
获取原文并翻译 | 示例

摘要

Behavioral synthesis tools have made significant progress in compiling high-level programs into register-transfer level (RTL) specifications. But manually rewriting code is still necessary in order to obtain better quality of results in memory system optimization. In recent years different automated memory optimization techniques have been proposed and implemented, such as data reuse and memory partitioning, but the problem of integrating these techniques into an applicable flow to obtain a better performance has become a challenge. In this paper we integrate data reuse, loop pipelining, memory partitioning, and memory merging into an automated optimization flow (AMO) for FPGA behavioral synthesis. We develop memory padding to help in the memory partitioning of indices with modulo operations. Experimental results on Xilinx Virtex-6 FPGAs show that our integrated approach can gain an average 5.8× throughput and 4.55× latency improvement compared to the approach without memory partitioning. Moreover, memory merging saves up to 44.32% of block RAM (BRAM).
机译:行为综​​合工具在将高级程序编译为寄存器传输级(RTL)规范方面取得了重大进展。但是仍然需要手动重写代码,以便在内存系统优化中获得更好的结果质量。近年来,已经提出并实现了不同的自动内存优化技术,例如数据重用和内存分区,但是将这些技术集成到适用的流程中以获得更好的性能的问题已成为一个挑战。在本文中,我们将数据重用,循环流水线,内存分区和内存合并集成到用于FPGA行为综合的自动优化流程(AMO)中。我们开发内存填充以通过模运算帮助索引的内存分区。在Xilinx Virtex-6 FPGA上的实验结果表明,与没有内存分区的方法相比,我们的集成方法平均可以获得5.8倍的吞吐量和4.55倍的延迟改进。此外,内存合并最多可节省44.32%的Block RAM(BRAM)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号