首页> 外文学位 >Algorithms and hardware designs for decimal multiplication.
【24h】

Algorithms and hardware designs for decimal multiplication.

机译:十进制乘法的算法和硬件设计。

获取原文
获取原文并翻译 | 示例

摘要

Although a preponderance of business data is in decimal form, virtually all floating-point arithmetic units on today's general-purpose microprocessors are based on the binary number system. Higher performance, less circuitry, and better overall error characteristics are the main reasons why binary floating-point hardware (BFP) is chosen over decimal floating-point (DFP) hardware. However, the binary number system cannot precisely represent many common decimal values. Further, although BFP arithmetic is well-suited for the scientific community, it is quite different from manual calculation norms and does not meet many legal requirements.;Due to the shortcomings of BFP arithmetic, many applications involving fractional decimal data are forced to perform their arithmetic either entirely in software or with a combination of software and decimal fixed-point hardware. Providing DFP hardware has the potential to dramatically improve the performance of such applications. Only recently has a large microprocessor manufacturer begun providing systems with DFP hardware. With available die area continually increasing, dedicated DFP hardware implementations are likely to be offered by other microprocessor manufacturers.;This dissertation discusses the motivation for decimal computer arithmetic, a brief history of this arithmetic, and relevant software and processor support for a variety of decimal arithmetic functions. As the context of the research is the IEEE Standard for Floating-point Arithmetic (IEEE 754-2008) and two-state transistor technology, descriptions of the standard and various decimal digit encodings are described.;The research presented investigates algorithms and hardware support for decimal multiplication, with particular emphasis on DFP multiplication. Both iterative and parallel implementations are presented and discussed. Novel ideas are advanced such as the use of decimal counters and compressors and the support of IEEE 754-2008 floating-point, including early estimation of the shift amount, in-line exception handling, on-the-fly sticky bit generation, and efficient decimal rounding. The iterative and parallel, decimal multiplier designs are compared and contrasted in terms of their latency, throughput, area, delay, and usage.;The culmination of this research is the design and comparison of an iterative DFP multiplier with a parallel DFP multiplier. The iterative DFP multiplier is significantly smaller and may achieve a higher practical frequency of operation than the parallel DFP multiplier. Thus, in situations where the area available for DFP is an important design constraint, the iterative DFP multiplier may be an attractive implementation. However, the parallel DFP multiplier has less latency for a single multiply operation and is able to produce a new result every cycle. As for power considerations, the fewer overall devices in the iterative multiplier, and more importantly the fewer storage elements, should result in less leakage. This benefit is mitigated by its higher latency and lower throughput.;The proposed implementations are suitable for general-purpose, server, and main-frame microprocessor designs. Depending on the demand for DFP in human-centric applications, this research may be employed in the application-specific integrated circuits (ASICs) market.
机译:尽管大多数业务数据都是十进制形式,但实际上当今通用微处理器上的所有浮点算术单元都是基于二进制数字系统的。更高的性能,更少的电路以及更好的总体错误特性是为什么选择二进制浮点硬件(BFP)而不是十进制浮点(DFP)硬件的主要原因。但是,二进制数系统不能精确表示许多常见的十进制值。此外,尽管BFP算法非​​常适合科学界,但它与手动计算规范有很大不同,并且不满足许多法律要求。;由于BFP算法的缺点,许多涉及分数小数数据的应用程序被迫执行其算术运算可以完全在软件中进行,也可以在软件与十进制定点硬件的组合中进行。提供DFP硬件有可能极大地改善此类应用程序的性能。直到最近,一家大型微处理器制造商才开始为系统提供DFP硬件。随着可用管芯面积的不断增加,其他微处理器制造商可能会提供专用的DFP硬件实现。本论文讨论了十进制计算机算法的动机,该算法的简要历史以及对各种十进制的相关软件和处理器支持。算术函数。由于研究的背景是IEEE浮点算术标准(IEEE 754-2008)和二态晶体管技术,因此描述了该标准的描述以及各种十进制数字编码。十进制乘法,尤其是DFP乘法。本文介绍并讨论了迭代实现和并行实现。提出了一些新的想法,例如使用十进制计数器和压缩器以及对IEEE 754-2008浮点的支持,包括对偏移量的早期估计,在线异常处理,即时粘滞位生成以及高效的支持。十进制舍入。比较迭代和并行十进制乘法器设计的时延,吞吐量,面积,延迟和使用情况。;本研究的最高点是迭代DFP乘法器与并行DFP乘法器的设计和比较。与并行DFP乘法器相比,迭代DFP乘法器要小得多,并且可以实现更高的实际操作频率。因此,在DFP可用面积是重要设计约束的情况下,迭代DFP乘数可能是一种有吸引力的实现。但是,并行DFP乘法器对单个乘法操作的等待时间较短,并且能够在每个周期产生新结果。对于功率方面的考虑,迭代乘法器中的整体设备越少,更重要的是,存储元件越少,应导致更少的泄漏。它的较高的延迟和较低的吞吐量降低了此好处。建议的实现适用于通用,服务器和大型机微处理器设计。根据以人为中心的应用程序对DFP的需求,该研究可能会在专用集成电路(ASIC)市场中进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号