Multiple-Precision Scaled Vector Addition on Graphics Processing Unit

机译：图形处理单元上的多精度缩放矢量加法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many large problems need linear algebra operations with a precision exceeding the standard floating-point binary64 format. In this paper, we implement a multiple-precision scaled vector addition BLAS routine (WAXPBY) on graphics processing units. We use a residue number system (RNS) to represent significands of floating-point values. In RNS, large numbers replace with their residues and the operations of addition, subtraction and multiplication perform on these residues in parallel and without carry propagation. Our parallel WAXPBY algorithm is divided into a number of steps, and each step is carried out by a separate GPU kernel. Experiments show that the developed routine clearly outperforms parallel CPU-based multiple-precision implementations.

机译：许多大问题都需要线性代数运算，其精度必须超过标准浮点binary64格式。在本文中，我们在图形处理单元上实现了多精度缩放矢量加法BLAS例程（WAXPBY）。我们使用残数系统（RNS）表示有效的浮点值。在RNS中，大量的残基代替了它们，并且对这些残基的加法，减法和乘法运算是并行进行的，并且没有进位传播。我们的并行WAXPBY算法分为多个步骤，每个步骤都由单独的GPU内核执行。实验表明，所开发的例程明显优于基于并行CPU的多精度实现。

著录项

来源
《International Conference on Parallel Computing Technologies》|2019年|179-186|共8页
会议地点 Almaty(KZ)
作者
Konstantin Isupov; Alexander Kuvaev;
展开▼
作者单位

Vyatka State University Kirov 610000 Russia;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
High-precision computations; Computer arithmetic; Residue number system; BLAS; OUDA;

机译：高精度计算；计算机算法；残数系统； BLAS；乌达;

相似文献

外文文献
中文文献
专利

1. Accelerating the RTTOV-7 IASI and AMSU-A radiative transfer models on graphics processing units: evaluating central processing unit/graphics processing unit-hybrid and pure-graphics processing unit approaches [J] . Jarno Mielikainen, Bormin Huang, Hung-Lung Allen Huang, Journal of Applied Remote Sensing . 2011,第Null期

机译：加速图形处理单元上的RTTOV-7 IASI和AMSU-A辐射传递模型：评估中央处理单元/图形处理单元-混合和纯图形处理单元方法
2. Preselective Screening for Linear-Scaling Exact Exchange-Gradient Calculations for Graphics Processing Units and General Strong-Scaling Massively Parallel Calculations [J] . Kussmann Joerg, Ochsenfeld Christian Journal of chemical theory and computation: JCTC . 2015,第3期

机译：用于图形处理单元的线性比例精确交换梯度计算和常规强比例大规模大规模并行计算的预选筛选
3. Stochastic proximity embedding on graphics processing units: Taking multidimensional scaling to a new scale [J] . Yang E., Liu P., Rassokhin D.N., Journal of chemical information and modeling . 2011,第11期

机译：在图形处理单元上嵌入随机接近度：将多维缩放比例提高到新的比例
4. Multiple-Precision Scaled Vector Addition on Graphics Processing Unit [C] . Konstantin Isupov, Alexander Kuvaev International Conference on Parallel Computing Technologies . 2019

机译：图形处理单元上的多精度缩放矢量添加
5. Accelerating scientific computation in bioinformatics by using graphics processing units as parallel vector processors. [D] . Payne, Bryson R. 2005

机译：通过将图形处理单元用作并行向量处理器，加快生物信息学的科学计算。
6. The feasibility of genome-scale biological network inference using Graphics Processing Units [O] . Raghuram Thiagarajan, Amir Alavi, Jagdeep T. Podichetty, 2017

机译：使用图形处理单元进行基因组规模生物网络推断的可行性
7. Multiple-Precision BLAS Library for Graphics Processing Units [O] . Konstantin Isupov, Vladimir Knyazkov 2020

机译：用于图形处理单元的多精度BLAS库

Multiple-Precision Scaled Vector Addition on Graphics Processing Unit

摘要

著录项

相似文献

相关主题

期刊订阅