首页> 外文会议>2012 Third International Conference on Networking and Computing. >Towards a Low-Power Accelerator of Many FPGAs for Stencil Computations
【24h】

Towards a Low-Power Accelerator of Many FPGAs for Stencil Computations

机译:向许多用于模版计算的FPGA的低功耗加速器迈进

获取原文
获取原文并翻译 | 示例

摘要

We have proposed the effective stencil computation method and the architecture by employing multiple small FPGAs with 2D-mech topology. In this paper, we show that our proposed architecture works correctly on the real 2D-mesh connected FPGA array. We developed a software simulator in C++, which emulates our proposed architecture, and implemented two prototype systems in Verilog HDL. One prototype system is for logic verification with communication modules and the other is for estimation of power consumption without communication modules. We run the former prototype system for 2M cycles and check the behavior with the software simulator. Our architecture is developed towards a low-power accelerator of many FPGAs. The evaluation result with the second prototype shows that the system of a single FPGA node with eight floating-point adders and eight floating-point multipliers archives 2.24GFlop/s in 0.16GHz operations with 2.37W power consumption. This performance/W value is about six-times better than NVidia GTX280 GPU card.
机译:我们已经提出了一种有效的模板计算方法和架构,它是通过使用多个具有二维机电拓扑的小型FPGA来实现的。在本文中,我们证明了我们提出的架构可在真正的2D网格连接的FPGA阵列上正常工作。我们用C ++开发了一个软件模拟器,可以模拟我们提出的体系结构,并在Verilog HDL中实现了两个原型系统。一个原型系统用于带有通信模块的逻辑验证,而另一个原型系统用于在没有通信模块的情况下估计功耗。我们将以前的原型系统运行2M个周期,并使用软件模拟器检查行为。我们的架构是针对许多FPGA的低功耗加速器而开发的。第二个原型的评估结果表明,具有8个浮点加法器和8个浮点乘法器的单个FPGA节点的系统在0.16GHz的操作中以2.37W的功耗归档2.24GFlop / s。该性能/ W值约为NVidia GTX280 GPU卡的六倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号