首页> 外文期刊>IEEE Transactions on Computers >Delay-optimized implementation of IEEE floating-point addition
【24h】

Delay-optimized implementation of IEEE floating-point addition

机译:延迟优化的IEEE浮点加法实现

获取原文
获取原文并翻译 | 示例

摘要

We present an IEEE floating-point adder (FP-adder) design. The adder accepts normalized numbers, supports all four IEEE rounding modes, and outputs the correctly normalized rounded sum/difference in the format required by the IEEE Standard. The FP-adder design achieves a low latency by combining various optimization techniques such as: a nonstandard separation into two paths, a simple rounding algorithm, unification of rounding cases for addition and subtraction, sign-magnitude computation of a difference based on one's complement subtraction, compound adders, and fast circuits for approximate counting of leading zeros from borrow-save representation. We present technology-independent analysis and optimization of our implementation based on the Logical Effort hardware model and we determine optimal gate sizes and optimal buffer insertion. We estimate the delay of our optimized design at 30.6 FO4 delays for double precision operands (15.3 FO4 delays per stage between latches). We overview other IEEE FP addition algorithms from the literature and compare these algorithms with our algorithm. We conclude that our algorithm has shorter latency (-13 percent) and cycle time (-22 percent) compared to the next fastest algorithm.
机译:我们提出一种IEEE浮点加法器(FP-adder)设计。加法器接受归一化的数字,支持所有四个IEEE舍入模式,并以IEEE标准要求的格式输出正确归一化的舍入和/差。 FP加法器设计通过组合各种优化技术来实现低延迟,这些优化技术包括:将非标准路径分为两条路径,简单的舍入算法,统一加法和减法的舍入情况,基于补码减法的差的符号幅度计算,复合加法器和快速电路,用于从借位保存表示形式中对前导零进行近似计数。我们基于Logical Effort硬件模型介绍了与技术无关的分析和实现的优化,并确定了最佳的门大小和最佳的缓冲区插入。对于双精度操作数,我们估计优化设计的延迟为30.6 FO4延迟(锁存器之间每级15.3 FO4延迟)。我们从文献中概述了其他IEEE FP加法算法,并将这些算法与我们的算法进行比较。我们得出的结论是,与下一个最快的算法相比,我们的算法具有更短的延迟(-13%)和周期时间(-22%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号