...
首页> 外文期刊>IEEE Transactions on Computers >High-Performance Designs for Linear Algebra Operations on Reconfigurable Hardware
【24h】

High-Performance Designs for Linear Algebra Operations on Reconfigurable Hardware

机译:可重构硬件上线性代数运算的高性能设计

获取原文
获取原文并翻译 | 示例

摘要

Numerical linear algebra operations are key primitives in scientific computing. Performance optimizations of such operations have been extensively investigated. With the rapid advances in technology, hardware acceleration of linear algebra applications using FPGAs (Field Programmable Gate Arrays) has become feasible. In this paper, we propose FPGA-based designs for several basic linear algebra operations, including dot product, matrix-vector multiplication, matrix multiplication and matrix factorization. By identifying the parameters for each operation, we analyze the trade-offs and propose a high-performance design. In the implementations of the designs, the values of the parameters are determined according to the hardware constraints, such as the available chip area, the size of available memory, the memory bandwidth, and the number of I/O pins. The proposed designs are implemented on Xilinx Virtex-II Pro FPGAs. Experimental results show that our designs scale with the available hardware resources. Also, the performance of our designs compares favorably with that of general-purpose processor based designs. We also show that with faster floating-point units and larger devices, the performance of our designs increases accordingly.
机译:数值线性代数运算是科学计算中的关键原语。此类操作的性能优化已得到广泛研究。随着技术的飞速发展,使用FPGA(现场可编程门阵列)的线性代数应用的硬件加速已变得可行。在本文中,我们为几种基本的线性代数运算提出了基于FPGA的设计,包括点积,矩阵向量乘法,矩阵乘法和矩阵分解。通过确定每个操作的参数,我们分析了取舍并提出了高性能设计。在设计的实现中,参数的值是根据硬件约束来确定的,例如可用的芯片面积,可用的内存大小,内存带宽以及I / O引脚数。拟议的设计在Xilinx Virtex-II Pro FPGA上实现。实验结果表明,我们的设计可以根据可用的硬件资源进行扩展。而且,我们的设计性能与基于通用处理器的设计相比具有优势。我们还表明,使用更快的浮点单元和更大的设备,我们的设计性能会相应提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号