首页> 外文会议>IEEE Symposium on Computer Arithmetic >A New Multiplication Algorithm for Extended Precision Using Floating-Point Expansions
【24h】

A New Multiplication Algorithm for Extended Precision Using Floating-Point Expansions

机译:使用浮点展开扩展精度的新乘法算法

获取原文

摘要

Some important computational problems must use a floating-point (FP) precision several times higher than the hardware-implemented available one. These computations critically rely on software libraries for high-precision FP arithmetic. The representation of a high-precision data type crucially influences the corresponding arithmetic algorithms. Recent work showed that algorithms for FP expansions, that is, a representation based on unevaluated sum of standard FP types, benefit from various high-performance support for native FP, such as low latency, high throughput, vectorization, threading, etc. Bailey's QD library and its corresponding Graphics Processing Unit (GPU) version, GQD, are such examples. Despite using native FP arithmetic as the key operations, QD and GQD algorithms are focused on double-double or quad-double representations and do not generalize efficiently or naturally to a flexible number of components in the FP expansion. In this paper, we introduce a new multiplication algorithm for FP expansion with flexible precision, up to the order of tens of FP elements in mind. The main feature consists in the partial products being accumulated in a special designed data structure that has the regularity of a fixed-point representation while allowing the computation to be naturally carried out using native FP types. This allows us to easily avoid unnecessary computation and to present rigorous accuracy analysis transparently. The algorithm, its correctness and accuracy proofs and some performance comparisons with existing libraries are all contributions of this paper.
机译:一些重要的计算问题必须使用浮点(FP)精度,该精度是硬件实现的浮点精度的几倍。这些计算严重依赖于软件库来进行高精度FP算术运算。高精度数据类型的表示方式会严重影响相应的算术算法。最近的工作表明,用于FP扩展的算法(即基于未评估的标准FP类型总和的表示)受益于对本机FP的各种高性能支持,例如低延迟,高吞吐量,矢量化,线程化等。Bailey的QD示例库和其相应的图形处理单元(GPU)版本GQD。尽管使用本机FP算术作为关键操作,但QD和GQD算法专注于双精度双精度或四重双精度表示,并且无法有效或自然地推广到FP扩展中的灵活数量的组件。在本文中,我们介绍了一种用于FP扩展的新乘法算法,具有灵活的精度,最多可考虑数十个FP元素。主要特征在于将部分产品累积在经过特殊设计的数据结构中,该结构具有定点表示的规则性,同时允许使用本机FP类型自然进行计算。这使我们能够轻松避免不必要的计算,并透明地呈现严格的精度分析。该算法,其正确性和准确性证明以及与现有库的一些性能比较都是本文的贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号