A Comparison of FPGA Implementations of Bit-Level and Word-Level Matrix Multipliers

机译：位级和字级矩阵乘法器的FPGA实现比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We have implemented a novel bit-level matrix multiplier on a Xilinx FPGA chip where each processing element does a simple operation of adding three to six bits to generate one partial sum bit and one to two carryout bits. The speedup over word-level is possible because individual bits of a word do not have to be processed as a unit in a bit-level architecture. It is shown in a previous work that bit-level architectures for fixed point applications can be O(log p) times faster than the corresponding word-level architecture where ?is the word length. In this paper we implemented the bit-level matrix multiplier on a Xilinx FPGA chip that is compared to a word-level matrix multiplier composed of highly optimized multiplier and adder macros available in the Xilinx Core generator library. The architecture presented in this paper is even faster than previous ones by breaking the critical path in the dependence graph into half. Our results show that speedup by a factor of 2 can be obtained in practice.

机译：我们已经在Xilinx FPGA芯片上实现了一种新颖的位级矩阵乘法器，其中每个处理元件都执行一个简单的操作，即将三到六位相加以生成一个部分和位和一到两个进位位。在字级上进行加速是可能的，因为字的各个位不必在位级体系结构中作为一个单元进行处理。在先前的工作中表明，定点应用程序的位级体系结构可以比相应的字级体系结构快O（log p）倍，其中？是字长。在本文中，我们在Xilinx FPGA芯片上实现了位级矩阵乘法器，并将其与由Xilinx Core生成器库中提供的高度优化的乘法器和加法器宏组成的字级矩阵乘法器进行了比较。通过将依赖关系图中的关键路径分为两半，本文提出的体系结构比以前的体系结构甚至更快。我们的结果表明，在实践中可以将速度提高2倍。

著录项

来源
《10th International Conference on Field-Programmable Logic and Applications: The Roadmap to Reconfigurable Computing FPL 2000, 10th, Aug 27-30, 2000, Villach, Austria 》|2000年|p.422-431|共10页
会议地点 Villach(AT);Villach(AT)
作者
Radhika S. Grover; Weijia Shang; Qiang Li;
展开▼
作者单位

Department of Computer Engineering, Santa Clara University, Santa Clara, CA, U.S.A.;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Performance Comparison and Design Implementation of Low Area Complex Matrix Multiplier Using FPGA Applied for MIMO Equalization in 5G Environments [J] . P. Prakash, M. Kannan Journal of computational and theoretical nanoscience . 2018 ,第4期

机译：低面积复杂矩阵乘法器的性能比较与设计实现使用FPGA在5G环境中应用MIMO均衡
2. High throughput VLSI implementation of discrete orthogonaltransforms using bit-level vector-matrix multiplier [J] . Nayak S.S., Meher P.K. IEEE Transactions on Circuits and Systems. II, Express Briefs . 1999 ,第5期

机译：使用位级矢量矩阵乘法器的离散正交变换的高吞吐量VLSI实现
3. High throughput VLSI implementation of discrete orthogonal transforms using bit-level vector-matrix multiplier [J] . Nayak S.S., Meher P.K. IEEE Transactions on Circuits and Systems. II . 1999 ,第5期

机译：使用位级矢量矩阵乘法器的离散正交变换的高吞吐量VLSI实现
4. A Comparison of FPGA Implementations of Bit-Level and Word-Level Matrix Multipliers [C] . Radhika S. Grover, Weijia Shang, Qiang Li International Conference on Field-Programmable Logic and Applications . 2000

机译：比特级与字级矩阵乘法器的FPGA实现的比较
5. A comparison of FPGA implementations of hybrid serial-parallel multiplier. [D] . Babu, Biju. 2010

机译：混合串行并行乘法器的FPGA实现比较。
6. Identification of Gram-Positive Cocci by Use of Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry: Comparison of Different Preparation Methods and Implementation of a Practical Algorithm for Routine Diagnostics [O] . Bettina Schulthess, Katharina Brodner, Guido V. Bloemberg, 2013

机译：基质辅助激光解吸电离-飞行时间质谱鉴定革兰氏阳性球菌：不同制备方法的比较和常规诊断实用算法的实现
7. FPGA Versus ASIC Implementation of Radix-8 Scalable Montgomery Modular Multiplier.(Dept.E) FPGA Versus ASIC Implementation of Radix-8 Scalable Montgomery Modular Multiplier (Dept.E) [O] . Ateef Ebrahim, Hamed ElSemary, Amen Nassar 2020

机译：FPGA与基数-8可扩展蒙哥马利模块化倍增器的ASIC实现。（DEPT.E）FPGA与ASIC实现的基数-8可扩展蒙格组合模块化倍增仪（DEPT.E）
8. Efficient Bit-Level, Word-Level, and Block-Level Systolic Arrays for Matrix-Matrix Multiplication [R] . De Groot, A. J. , Parker, S. R. , Johansson, E. M. 1988

机译：用于矩阵乘法的高效位级，字级和块级收缩阵列

A Comparison of FPGA Implementations of Bit-Level and Word-Level Matrix Multipliers

摘要

著录项

相似文献

相关主题

期刊订阅