首页> 外文期刊>Procedia Computer Science >Toward a parallel solver for generalized complex symmetric eigenvalue problems
【24h】

Toward a parallel solver for generalized complex symmetric eigenvalue problems

机译:面向广义复杂对称特征值问题的并行求解器

获取原文
           

摘要

Methods for numerically solving generalized complex symmetric (non-Hermitian) eigenvalue problems (EVPs)Ax=λBxserially and in parallel are investigated. This research is motivated by two observations: Firstly, the conventional approach for solving such problems serially, as implemented, e.g., in zggev (LAPACK), is to treat complex symmetric problems as general complex and therefore does not exploit the structural properties. Secondly, there is currently no parallel solver for dense (generalized or standard) non-Hermitian EVPs in ScaLAPACK. The approach presented in this paper especially aims at exploiting the structural properties present in complex symmetric EVPs and at investigating the potential trade-offs between performance improvements and loss of numerical accuracy due to instabilities. For the serial case, a complete reduction based solver for computing eigenvalues of the generalized complex symmetric EVP has been designed, implemented, and is evaluated in terms of numerical accuracy as well as in terms of runtime performance. It is shown that the serial solver achieves a speedup of up to 43 compared to zggev (LAPACK), although at the cost of a reduced accuracy. Furthermore, the major parts of this reduction based solver have been parallelized based on ScaLAPACK and MPI. Their scaling behavior is evaluated on a cluster utilizing up to 1024 cores. Moreover, the parallel codes developed achieve encouraging parallel speedups comparable to the ones of ScaLAPACK routines for the complex Hermitian EVP.
机译:研究了通过串行和并行方式数值求解广义复杂对称(非Hermitian)特征值问题(EVPs)Ax =λBB的方法。这项研究受到两个观察结果的推动:首先,例如以zggev(LAPACK)实施的串行解决此类问题的常规方法是将复杂对称问题视为一般复杂问题,因此不利用结构特性。其次,目前在ScaLAPACK中没有用于密集(广义或标准)非Hermitian EVP的并行求解器。本文提出的方法尤其旨在利用复杂对称EVP中存在的结构特性,并研究性能改进和由于不稳定性而导致的数值精度损失之间的潜在折衷。对于串行情况,已经设计,实现了用于计算广义复对称EVP的特征值的基于完全约简的求解器,并根据数值精度以及运行时性能对其进行了评估。结果表明,与zggev(LAPACK)相比,串行解算器可实现高达43的加速,尽管其代价是精度降低。此外,基于还原的求解器的主要部分已基于ScaLAPACK和MPI进行了并行化。在使用多达1024个内核的群集上评估它们的扩展行为。此外,与复杂的Hermitian EVP的ScaLAPACK例程相比,开发的并行代码实现了令人鼓舞的并行加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号