首页> 外文期刊>Circuits, systems, and signal processing >An Unified Architecture for Single, Double, Double-Extended, and Quadruple Precision Division
【24h】

An Unified Architecture for Single, Double, Double-Extended, and Quadruple Precision Division

机译:适用于单精度,双精度,双精度扩展和四重精度部门的统一体系结构

获取原文
获取原文并翻译 | 示例

摘要

A hardware architecture for quadruple precision floating point division arithmetic with multi-precision support is presented. Division is an important yet far more complex arithmetic operation than addition and multiplication, which demands significant amount of hardware resources for a complete implementation. The proposed architecture also supports the processing of single-, double-, and double-extended precision computations with varied latency. An iterative multiplicative-based architecture for multi-precision quadruple precision division is proposed with small size and promising performance. The proposed mantissa division architecture, the most complex sub-unit, employs a series expansion methodology of division. The architecture follows the standard state-of-the-art flow for floating point division arithmetic with normal as well as subnormal processing. The proposed division architecture is synthesized using UMC 90nm ASIC standard cell library. It is also demonstrated using a Xilinx FPGA-based implementation which is integrated with a wide integer multiplier for mantissa division further optimized for FPGA implementations facilitating the built-in DSP blocks efficiently. When compared to existing quadruple precision divider available in the literature, the proposed architecture has 25% equivalent area saving, 2x improvement in latency with improved speed on FPGA platform; and it has more than 50% area saving, 3x improvement in latency-throughput with better speed on ASIC platform.
机译:提出了一种具有多精度支持的四精度浮点除法算法的硬件架构。除法是一种重要的运算,但比加法和乘法复杂得多,因为运算需要大量的硬件资源才能完整实现。所提出的体系结构还支持处理具有变化的等待时间的单,双和双扩展精度计算。提出了一种基于迭代乘法的多精度四重精度除法体系结构,该体系结构尺寸小且性能良好。提议的尾数划分架构是最复杂的子单元,它采用了一系列划分的扩展方法。该体系结构遵循标准的最新技术流程,可进行浮点除法运算以及常规和次常规处理。所提出的划分架构是使用UMC 90nm ASIC标准单元库合成的。还使用基于Xilinx FPGA的实现进行了演示,该实现与用于尾数除法的宽整数乘法器集成在一起,进一步针对FPGA实现进行了优化,从而可以有效地实现内置DSP模块。与文献中现有的四重精度分频器相比,该架构节省了25%的等效面积,在FPGA平台上的延迟提高了2倍,速度提高了;节省的空间超过50%,延迟吞吐量提高了3倍,在ASIC平台上的速度更快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号