首页> 外文期刊>Journal of Computational and Applied Mathematics >An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
【24h】

An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures

机译:分布式内存架构上有效的混合三角形划分和征管算法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we propose an efficient divide-and-conquer (DC) algorithm for symmetric tridiagonal matrices based on ScaLAPACK and the hierarchically semiseparable (HSS) matrices. HSS is an important type of rank-structured matrices. The most computationally intensive part of the DC algorithm is computing the eigenvectors via matrix-matrix multiplications (MMM). In our parallel hybrid DC (PHDC) algorithm, MMM is accelerated by using HSS matrix techniques when the intermediate matrix is large. All the HSS computations are performed via the package STRUMPACK. PHDC has been tested by using many different matrices. Compared with the DC implementation in MKL, PHDC can be faster for some matrices with few deflations when using hundreds of processes. However, the gains decrease as the number of processes increases. The comparisons of PHDC with ELPA (the Eigenvalue soLvers for Petascale Applications library) are similar. PHDC is usually slower than MKL and ELPA when using 300 or more processes on the Tianhe-2 supercomputer. (C) 2018 Elsevier B.V. All rights reserved.
机译:在本文中,我们提出了一种基于缩放标签的对称三角形矩阵的有效的分割(DC)算法和分层半可索收(HSS)矩阵。 HSS是一种重要类型的秩结构矩阵。直流算法的最多计算密集的部分是通过矩阵矩阵乘法(MMM)计算特征向量。在我们的平行混合直流(PHDC)算法中,当中间矩阵大时,通过使用HSS矩阵技术加速MMM。所有HSS计算都是通过包Strumpack执行的。通过使用许多不同的矩阵来测试PHDC。与MKL中的直流实现相比,在使用数百个过程时,某些矩阵的PHDC可以更快地为一些矩阵。然而,随着过程的数量增加,收益减少。 PHDC与ELPA的比较(PETASCALE应用库的特征值溶剂)是相似的。 PHDC通常比MKL和ELPA在天河2超级计算机上使用300个或更多工艺。 (c)2018年elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号