A floating point conversion algorithm for mixed precision computations

Choon?Lih?Hoo; Sallehuddin?Mohamed?Haris; Nik?Abdullah?Nik?Mohamed

首页> 外文期刊>Journal of Zhejiang university science >A floating point conversion algorithm for mixed precision computations

【24h】

A floating point conversion algorithm for mixed precision computations

机译：用于混合精度计算的浮点转换算法

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The floating point number is the most commonly used real number representation for digital computations due to its high precision characteristics. It is used on computers and on single chip applications such as DSP chips. double precision (64-bit) representations allow for a wider range of real numbers to be denoted. However, single precision (32-bit) operations are more efficient. Recently, there has been an increasing interest in mixed precision computations which take advantage of single precision efficiency on 64-bit numbers. This calls for the ability to interchange between the two formats. In this paper, an algorithm that converts floating point numbers from 64- to 32-bit representations is presented. The algorithm was implemented as a verilog code and tested on field programmable gate array (FPGA) using the Quartus II DE2 board and Agilent 16821A portable logic analyzer. Results indicate that the algorithm can perform the conversion reliably and accurately within a constant execution time of 25 ns with a 20 MHz clock frequency regardless of the number being converted.

机译：浮点数由于其高精度特性而成为数字计算中最常用的实数表示。它用于计算机和单芯片应用程序，例如DSP芯片。双精度（64位）表示允许表示更大范围的实数。但是，单精度（32位）操作效率更高。最近，人们对混合精度计算越来越感兴趣，这种计算利用了64位数字的单精度效率。这要求能够在两种格式之间互换。本文提出了一种将浮点数从64位表示转换为32位表示的算法。该算法被实现为Verilog代码，并使用Quartus II DE2板和Agilent 16821A便携式逻辑分析仪在现场可编程门阵列（FPGA）上进行了测试。结果表明，该算法可以在25 ns的恒定执行时间内以20 MHz的时钟频率可靠，准确地执行转换，而与转换的数量无关。

著录项

来源
《Journal of Zhejiang university science》 |2012年第9期|共8页
作者
Choon?Lih?Hoo; Sallehuddin?Mohamed?Haris; Nik?Abdullah?Nik?Mohamed;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A floating point conversion algorithm for mixed precision computations [J] . Choon Lih HOO, Sallehuddin Mohamed HARIS, Nik Abdullah Nik MOHAMED 浙江大学学报（英文版）（C辑：计算机与电子） . 2012,第009期

机译：用于混合精度计算的浮点转换算法
2. Accelerating scientific computations with mixed precision algorithms [J] . Baboulin M, Buttari A, Dongarra J, Computer physics communications . 2009,第12期

机译：使用混合精度算法加速科学计算
3. Arithmetic Algorithms for Extended Precision Using Floating-Point Expansions [J] . M. Joldeş, O. Marty, J. M. Muller, IEEE Transactions on Computers . 2016,第4期

机译：使用浮点扩展来扩展精度的算术算法
4. Automatically Adapting Programs for Mixed-Precision Floating-Point Computation [C] . Michael O. Lam, Jeffrey K. Hollingsworth, Bronis R. de Supinski, ACM international conference on supercomputing . 2013

机译：混合精度浮点计算的自动调整程序
5. Efficient floating-point error testing and rigorous mixed precision tuning. [D] . Chiang, Wei-Fan. 2016

机译：高效的浮点错误测试和严格的混合精度调整。
6. Mixed-Precision Deep Learning Based on Computational Memory [O] . S. R. Nandakumar, Manuel Le Gallo, Christophe Piveteau, 2020

机译：基于计算记忆的混合精密深度学习
7. Accelerating scientific computations with mixed precision algorithms [O] . Baboulin Marc, Buttari Alfredo, Dongarra Jack, 2009

机译：使用混合精度算法加速科学计算

A floating point conversion algorithm for mixed precision computations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅