HIGH PRECISION INTEGER ADDITION, SUBTRACTION AND MULTIPLICATION WITH A GRAPHICS PROCESSING UNIT

NIALL EMMART and CHARLES WEEMS

首页> 外文期刊>Parallel Processing Letters >HIGH PRECISION INTEGER ADDITION, SUBTRACTION AND MULTIPLICATION WITH A GRAPHICS PROCESSING UNIT

【24h】

HIGH PRECISION INTEGER ADDITION, SUBTRACTION AND MULTIPLICATION WITH A GRAPHICS PROCESSING UNIT

机译：图形处理单元的高精度整数加法，减法和乘法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper we evaluate the potential for using an NVIDIA graphics processing unitn(GPU) to accelerate high precision integer multiplication, addition, and subtraction. Thenreported peak vector performance for a typical GPU appears to offer good potential fornaccelerating such a computation. Because of limitations in the on-chip memory, the highncost of kernel launches, and the nature of the architecture’s support for parallelism, wenused a hybrid algorithmic approach to obtain good performance on multiplication. Onnthe GPU itself we adapt the Strassen FFT algorithm to multiply 32KB chunks, while onnthe CPU we adapt the Karatsuba divide-and-conquer approach to optimize applicationnof the GPU’s partial multiplies, which are viewed as “digits” by our implementation ofnKaratsuba. Even with this approach, the result is at best a factor of three increase innperformance, compared with using the GMP package on a 64-bit CPU at a comparablentechnology node. Our implementations of addition and subtraction achieve up to a factornof eight improvement. We identify the issues that limit performance and discuss the likelynimpact of planned advances in GPU architecture.

机译：在本文中，我们评估了使用NVIDIA图形处理单元（GPU）加速高精度整数乘法，加法和减法的潜力。然后，针对典型GPU的报告峰值矢量性能似乎为加速此类计算提供了良好的潜力。由于片上存储器的局限性，内核启动的高昂成本以及该体系结构对并行性的支持性质，他们采用了一种混合算法方法来获得良好的乘法性能。在GPU本身上，我们采用Strassen FFT算法来乘以32KB的块，而在CPU上，我们采用了Karatsuba分治法来优化GPU的部分乘法的应用，nKaratsuba的实现将其视为“数字”。即使使用这种方法，与在可比较技术节点上的64位CPU上使用GMP软件包相比，结果最多也只能提高三倍的性能。我们的加法和减法实现最多提高了八分之一。我们确定了会限制性能的问题，并讨论了计划中的GPU架构改进可能产生的影响。

著录项

来源
《Parallel Processing Letters》 |2010年第4期|p.293-306|共14页
作者
NIALL EMMART and CHARLES WEEMS;
展开▼
作者单位

Computer Science DepartmentUniversity of MassachusettsAmherst, MA 01003-4610, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
parallel algorithm, multiple precision arithmetic, graphics processing unit;

机译：并行算法;多精度算法;图形处理单元;

相似文献

外文文献
中文文献
专利

1. HIGH PRECISION INTEGER ADDITION, SUBTRACTION AND MULTIPLICATION WITH A GRAPHICS PROCESSING UNIT [J] . NIALL EMMART, CHARLES WEEMS Parallel Processing Letters . 2010,第4期

机译：图形处理单元的高精度整数加法，减法和乘法
2. Design of RNS Based Addition Subtraction and Multiplication Units [J] . N Vivek, K Anusudha International Journal of Engineering Trends and Technology . 2014,第12期

机译：基于RNS的加减乘运算单元设计
3. Finite-Element Sparse Matrix Vector Multiplication on Graphic Processing Units [J] . Dehnavi M. M., Fernandez D. M., Giannacopoulos D. Magnetics, IEEE Transactions on . 2010,第8期

机译：图形处理单元上的有限元稀疏矩阵矢量乘法
4. High precision integer multiplication with a graphics processing unit [C] . Emmart N., Weems C. 2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum . 2010

机译：带有图形处理单元的高精度整数乘法
5. Psychometric properties of a brief basic math skill assessment for middle school students: Simultaneous assessment of addition, subtraction, multiplication, and division. [D] . Strait, Gerald Gill. 2008

机译：中学生简短的基本数学技能评估的心理计量学特性：加，减，乘和除的同时评估。
6. Addition and Subtraction but Not Multiplication and Division Cause Shifts of Spatial Attention [O] . Mengjin Li, Dixiu Liu, Min Li, 2018

机译：加法和减法而不是乘法和除法会引起空间注意力的转移
7. Generating units modulo an odd integer by addition and subtraction [O] . H. Lenstra 1993

机译：通过添加和减法生成单位Modulo一个奇数整数

HIGH PRECISION INTEGER ADDITION, SUBTRACTION AND MULTIPLICATION WITH A GRAPHICS PROCESSING UNIT

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅