首页> 外文期刊>Science of Computer Programming >Comparison of eigensolvers for symmetric band matrices
【24h】

Comparison of eigensolvers for symmetric band matrices

机译:对称谱带矩阵特征求解器的比较

获取原文
获取原文并翻译 | 示例
       

摘要

We compare different algorithms for computing eigenvalues and eigenvectors of a symmetric band matrix across a wide range of synthetic test problems. Of particular interest is a comparison of state-of-the-art tridiagonalization-based methods as implemented in Lapack or Plasma on the one hand, and the block divide-and-conquer (BD&C) algorithm as well as the block twisted factorization (BTF) method on the other hand. The BD&C algorithm does not require tridiagonalization of the original band matrix at all, and the current version of the BTF method tridiagonalizes the original band matrix only for computing the eigenvalues. Avoiding the tridiagonalization process sidesteps the cost of backtransformation of the eigenvectors. Beyond that, we discovered another disadvantage of the backtransformation process for band matrices: In several scenarios, a lot of gradual underflow is observed in the (optional) accumulation of the transformation matrix and in the (obligatory) backtransformation step. According to the IEEE 754 standard for floating-point arithmetic, this implies many operations with subnormal (denormalized) numbers, which causes severe slowdowns compared to the other algorithms without backtransformation of the eigenvectors. We illustrate that in these cases the performance of existing methods from Lapack and Plasma reaches a competitive level only if subnormal numbers are disabled (and thus the IEEE standard is violated). Overall, our performance studies illustrate that if the problem size is large enough relative to the bandwidth, BD&C tends to achieve the highest performance of all methods if the spectrum to be computed is clustered. For test problems with well separated eigenvalues, the BTF method tends to become the fastest algorithm with growing problem size.
机译:我们比较了广泛的综合测试问题中用于计算对称带矩阵特征值和特征向量的不同算法。特别令人感兴趣的是,一方面是在Lapack或Plasma中实现的基于最新的对角线化的方法的比较,以及块分割和征服(BD&C)算法以及块扭曲分解(BTF) )方法。 BD&C算法完全不需要原始带矩阵的三对角线化,而当前版本的BTF方法仅用于计算特征值的三对角线化原始带状矩阵。避免三对角化过程避开了特征向量逆变换的成本。除此之外,我们还发现了带矩阵逆变换过程的另一个缺点:在几种情况下,在变换矩阵的(可选)累积和(强制性)逆变换步骤中观察到大量的逐渐下溢。根据用于浮点算术的IEEE 754标准,这意味着许多具有次正规(非规格化)数字的运算,与没有特征向量的逆变换的其他算法相比,这会导致严重的减速。我们说明,在这些情况下,仅当禁用次标准数(因此违反了IEEE标准)时,Lapack和Plasma的现有方法的性能才能达到竞争水平。总体而言,我们的性能研究表明,如果问题的大小相对于带宽足够大,那么如果要计算的频谱是聚类的,BD&C往往会实现所有方法的最高性能。对于特征值充分分离的测试问题,BTF方法趋于成为问题规模不断扩大的最快算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号