Iterative sparse matrix-vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems

Bertil Schmidt; Hans Aribowo; Hoang-Vu Dang

首页> 外文期刊>Concurrency and Computation >Iterative sparse matrix-vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems

【24h】

Iterative sparse matrix-vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems

机译：迭代稀疏矩阵矢量乘法，用于在多图形处理单元系统上通过GF（2）加速块Wiedemann算法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The block Wiedemann (BW) algorithm is frequently used to solve sparse linear systems over GF(2). Iterative sparse matrix-vector multiplication is the most time-consuming operation. The necessity to accelerate this step is motivated by the application of BW to very large matrices used in the linear algebra step of the number field sieve (NFS) for integer factorization. In this paper, we derive an efficient CUDA implementation of this operation by using a newly designed hybrid sparse matrix format. This leads to speedups between 4 and 8 on a single graphics processing unit (GPU) for a number of tested NFS matrices compared with an optimized multicore implementation. We further present a GPU cluster implementation of the full BW for NFS matrices. A small-sized GPU cluster is able to outperform CPU clusters of larger size for large matrices such as the one obtained from the Kilobit special NFS factorization.

机译：块Wiedemann（BW）算法通常用于解决GF（2）上的稀疏线性系统。迭代稀疏矩阵矢量乘法是最耗时的操作。通过将BW应用于在整数场分解的数字场筛（NFS）的线性代数步骤中使用的非常大的矩阵，激发了加快此步骤的必要性。在本文中，我们通过使用新设计的混合稀疏矩阵格式来导出此操作的有效CUDA实现。与经过优化的多核实现相比，对于许多经过测试的NFS矩阵，这导致单个图形处理单元（GPU）的速度提高4到8。我们进一步介绍了用于NFS矩阵的完整带宽的GPU群集实现。对于大型矩阵（例如从Kilobit特殊NFS分解中获得的矩阵）而言，小型GPU集群能够胜过较大尺寸的CPU集群。

著录项

来源
《Concurrency and Computation》 |2013年第4期|586-603|共18页
作者
Bertil Schmidt; Hans Aribowo; Hoang-Vu Dang;
展开▼
作者单位

Institut fuer Informatik, University of Mainz, 55128 Mainz, Germany;

Institut fuer Informatik, University of Mainz, 55128 Mainz, Germany;

Institut fuer Informatik, University of Mainz, 55128 Mainz, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
SpMV; CUDA; block wiedemann; RSA; number field sieve; factorization;

机译：SpMV;CUDA;维德曼RSA;数场筛;因式分解;

相似文献

外文文献
中文文献
专利

1. A UNIFIED SPARSE MATRIX DATA FORMAT FOR EFFICIENT GENERAL SPARSE MATRIX-VECTOR MULTIPLICATION ON MODERN PROCESSORS WITH WIDE SIMD UNITS [J] . Kreutzer Moritz, Hager Georg, Wellein Gerhard, SIAM Journal on Scientific Computing . 2014,第5期

机译：在具有宽模拟单元的现代处理器上有效地通用稀疏矩阵-向量乘法的统一稀疏矩阵数据格式
2. GPU accelerated sparse matrix-vector multiplication and sparse matrix-transpose vector multiplication [J] . Yuan Tao, Yangdong Deng, Shuai Mu, Concurrency and computation: practice and experience . 2015,第14期

机译：GPU加速的稀疏矩阵-向量乘法和稀疏矩阵-转置向量乘法
3. Efficient multithreaded untransposed, transposed or symmetric sparse matrix-vector multiplication with the Recursive Sparse Blocks format [J] . Martone Michele Parallel Computing . 2014,第7期

机译：递归稀疏块格式的高效多线程未转置，转置或对称稀疏矩阵矢量乘法
4. Iterative sparse matrix-vector multiplication on in-memory cluster computing accelerated by GPUs for big data [C] . Jiwu Peng, Zheng Xiao, Cen Chen, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery . 2016

机译：GPU加速大数据的内存集群计算中的迭代稀疏矩阵矢量乘法
5. Exploring the potential for accelerating sparse matrix-vector product on a Processing-in-Memory architecture [D] . Youssefi, Annahita 2009

机译：探索在内存中处理架构上加速稀疏矩阵矢量乘积的潜力
6. Accelerated fast iterative shrinkage thresholding algorithms for sparsity-regularized cone-beam CT image reconstruction [O] . Qiaofeng Xu, Deshan Yang, Jun Tan, -1

机译：稀疏正则化锥束CT图像重建的加速快速迭代收缩阈值算法
7. Performance Analysis of Sparse Matrix-Vector Multiplication (SpMV) on Graphics Processing Units (GPUs) [O] . Sarah AlAhmadi, Thaha Mohammed, Aiiad Albeshri, 2020

机译：稀疏矩阵矢量乘法（SPMV）对图形处理单元（GPU）的性能分析

Iterative sparse matrix-vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅