首页> 外文会议>International Conference on Parallel Computing Technologies >Multiple-Precision Scaled Vector Addition on Graphics Processing Unit
【24h】

Multiple-Precision Scaled Vector Addition on Graphics Processing Unit

机译:图形处理单元上的多精度缩放矢量加法

获取原文

摘要

Many large problems need linear algebra operations with a precision exceeding the standard floating-point binary64 format. In this paper, we implement a multiple-precision scaled vector addition BLAS routine (WAXPBY) on graphics processing units. We use a residue number system (RNS) to represent significands of floating-point values. In RNS, large numbers replace with their residues and the operations of addition, subtraction and multiplication perform on these residues in parallel and without carry propagation. Our parallel WAXPBY algorithm is divided into a number of steps, and each step is carried out by a separate GPU kernel. Experiments show that the developed routine clearly outperforms parallel CPU-based multiple-precision implementations.
机译:许多大问题都需要线性代数运算,其精度必须超过标准浮点binary64格式。在本文中,我们在图形处理单元上实现了多精度缩放矢量加法BLAS例程(WAXPBY)。我们使用残数系统(RNS)表示有效的浮点值。在RNS中,大量的残基代替了它们,并且对这些残基的加法,减法和乘法运算是并行进行的,并且没有进位传播。我们的并行WAXPBY算法分为多个步骤,每个步骤都由单独的GPU内核执行。实验表明,所开发的例程明显优于基于并行CPU的多精度实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号