首页> 外文学位 >ON THE DESIGN OF HIGH PERFORMANCE DIGITAL ARITHMETIC UNITS.
【24h】

ON THE DESIGN OF HIGH PERFORMANCE DIGITAL ARITHMETIC UNITS.

机译:高性能数字算术单元的设计。

获取原文
获取原文并翻译 | 示例

摘要

Several new algorithms for use for enhancing the performance of pipelined digital computers have been developed and evaluated. The design of a particular such computer embodying most of these algorithms is discussed in detail--the S-1 Mark IIA. The relationship and importance of the new algorithms to the overall performance of such a machine is analyzed.;A floating-point addition algorithm which has a much shorter latency than previous approaches is developed and analyzed. This algorithm lends itself to the efficient simultaneous calculation of floating-point sums and differences, which is of great value computing FFTs and to other related algorithms. The algorithm resolves floating-point addition into one of two independent cases, each of which can be implemented in fewer logic gate delays than previous algorithms.;Previous techniques of sorting on pipelined machines are analyzed and a new algorithm based on Quicksort is developed. This new algorithm is significantly faster and simpler than previous pipelined sorting techniques.;The use of skewed data representations to increase the performance of interleaved memories for many algorithms is well known. However a large price is paid in convenience by the use of such techniques. A new approach which allows the use of normal data representations but which has all of the performance advantages of the skewed representations is described. This technique is particularly valuable since the hardware used to implement it can also serve as a queue to minimize the effects of temporary stoppages in the instruction and operand fetching and arithmetic execution hardware.;An algorithm for the very rapid pipelined computation of medium precision approximations (about 30 bits for the Mark IIA) to elementary functions is described. This method uses table lookup and two parallel multiplications to triple the precision available from direct table lookup. Current RAM technology permits the effective use of this algorithm for non-trivial word sizes. The method is applied to reciprocal, square-root, exponential, logarithm, arctangent, sine, cosine, and the error function.;Several other new techniques for performance enhancement are also described and analyzed, and fruitful directions for future work in this area are discussed.
机译:已经开发和评估了几种用于增强流水线数字计算机性能的新算法。专门讨论了包含大多数算法的此类计算机的设计-S-1 Mark IIA。分析了新算法与这种机器整体性能的关系和重要性。开发并分析了一种比以前的方法具有更短等待时间的浮点加法算法。该算法有助于有效地同时计算浮点数之和和差,这对于计算FFT和其他相关算法非常有用。该算法将浮点加法分解为两种独立的情况之一,与以前的算法相比,可以在更少的逻辑门延迟下实现浮点加法。分析了流水线机器上的先前排序技术,并开发了一种基于Quicksort的新算法。与以前的流水线排序技术相比,该新算法明显更快,更简单。对于许多算法,使用偏斜数据表示来提高交错存储器的性能是众所周知的。然而,通过使用这种技术在方便方面付出了高昂的代价。描述了一种允许使用常规数据表示但具有偏斜表示的所有性能优势的新方法。该技术特别有价值,因为用于实现该技术的硬件还可以用作队列,以最大程度地减少指令和操作数提取以及算术执行硬件中的临时停止的影响。;用于中等精度近似值的非常快速流水线计算的算法(描述了基本功能的大约30位(对于Mark IIA)。此方法使用表查找和两个并行乘法将直接表查找提供的精度提高三倍。当前的RAM技术允许有效地使用此算法处理非平凡的字长。该方法适用于倒数,平方根,指数,对数,反正切,正弦,余弦和误差函数。;还描述和分析了其他几种性能增强的新技术,并为该领域的未来工作指明了丰硕的方向。讨论过。

著录项

  • 作者

    FARMWALD, PAUL MICHAEL.;

  • 作者单位

    Stanford University.;

  • 授予单位 Stanford University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1981
  • 页码 101 p.
  • 总页数 101
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号