首页> 外文会议>International Conference on Field-Programmable Technology >Hardware Implementations with High Throughput, Low-Latency and Low-Area for Matrix Inversion
【24h】

Hardware Implementations with High Throughput, Low-Latency and Low-Area for Matrix Inversion

机译:具有高吞吐量,低延迟和低区域的硬件实现,用于矩阵反转

获取原文
获取外文期刊封面目录资料

摘要

This paper proposes two hardware architectures for matrix inversion, through the use of single and double-precision floating-point representations. The architectures use a modified version of the Gauss-Jordan algorithm that accelerates the processing of matrix inversion. The modified algorithm starts performing the normalization and elimination steps from the column that follows the pivot column instead of the first matrix column, which speeds up the process of the matrix inversion and hence achieves high performance. The first of the two hardware architectures is purely designed with registers, while memory blocks are required in the second architecture. The implementation results show that the modified version of the Gauss-Jordan algorithm has considerably improved the hardware performance of the matrix inversion, in terms of latency, throughput and hardware resources. The architectures have been optimized for Xilinx FPGAs and they are capable of operating at frequencies of 211.999 and 422.654 MHz in a Zynq xc7z045 FPGA.
机译:本文通过使用单一和双精度浮点表示提出了两个用于矩阵反转的硬件架构。该体系结构使用Gauss-jordan算法的修改版本,可以加速矩阵反转的处理。修改的算法开始从追随枢轴列而不是第一矩阵列的列执行归一化和消除步骤,从而加速了矩阵反转的过程,因此实现了高性能。这两个硬件架构中的第一个纯粹设计了寄存器,而在第二架构中需要内存块。实施结果表明,在延迟,吞吐量和硬件资源方面,高斯 - 乔丹算法的修改版本大大提高了矩阵反转的硬件性能。该架构已经针对Xilinx FPGA进行了优化,并且它们能够在Zynq XC7Z045 FPGA中以211.999和422.654 MHz的频率运行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号