首页> 外文会议>International conference on parallel and distributed processing techniques and applications >Performance Evaluation of Golub-Kahan-Lanczos Algorithm with Reorthogonalization by Classical Gram-Schmidt Algorithm and OpenMP
【24h】

Performance Evaluation of Golub-Kahan-Lanczos Algorithm with Reorthogonalization by Classical Gram-Schmidt Algorithm and OpenMP

机译:经典Gram-Schmidt算法和OpenMP进行正交化的Golub-Kahan-Lanczos算法的性能评估

获取原文

摘要

The Golub-Kahan-Lanczos algorithm with reorthogonalization (GKLR algorithm) is an algorithm for computing a subset of singular triplets for large-scale sparse matrices. The reorthogonalization tends to become a bottleneck of elapsed time, as the iteration number of the GKLR algorithm increases. In this paper, OpenMP-based parallel implementation of the classical Gram-Schmidt algorithm with reorthogonalization (0MP-CGS2 algorithm) is introduced. The 0MP-CGS2 algorithm has the advantage of data reusability and is expected to achieve higher performance of the reorthogonalization computations on shared-memory multi-core processors with large caches than the conventional reorthogonalization algorithms. Numerical experiments on shared-memory multi-core processors show that the OMP-CGS2 algorithm accelerates the GKLR algorithm more effectively for computing a subset of singular triplets for a sparse matrix than the conventional reorthogonalization algorithms.
机译:具有正交化的Golub-Kahan-Lanczos算法(GKLR算法)是一种用于为大型稀疏矩阵计算奇异三元组的子集的算法。随着GKLR算法的迭代次数增加,重新正交化趋向于成为经过时间的瓶颈。本文介绍了具有正交化功能的经典Gram-Schmidt算法(0MP-CGS2算法)的基于OpenMP的并行实现。 0MP-CGS2算法具有数据可重用性的优点,并且有望在具有大缓存的共享内存多核处理器上实现比常规的正交化算法更高的正交化计算性能。在共享内存多核处理器上的数值实验表明,与传统的正交算法相比,OMP-CGS2算法可以更有效地加速GKLR算法,以计算稀疏矩阵的奇异三元组的子集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号