...
首页> 外文期刊>Journal of Parallel and Distributed Computing >Parallel block tridiagonalization of real symmetric matrices
【24h】

Parallel block tridiagonalization of real symmetric matrices

机译:实对称矩阵的并行块对角线化

获取原文
获取原文并翻译 | 示例
           

摘要

Two parallel block tridiagonalization algorithms and implementations for dense real symmetric matrices are presented. Block tridiagonalization is a critical pre-processing step for the block tridiagonal divide-and-conquer algorithm for computing eigensystems and is useful for many algorithms desiring the efficiencies of block structure in matrices. For an "effectively" sparse matrix, which frequently results from applications with strong locality properties, a heuristic parallel algorithm is used to transform it into a block tridiagonal matrix such that the eigenvalue errors remain bounded by some prescribed accuracy tolerance. For a dense matrix without any usable structure, orthogonal transformations are used to reduce it to block tridiagonal form using mostly level 3 BLAS operations. Numerical experiments show that block tridiagonal structure obtained from this algorithm directly affects the computational complexity of the parallel block tridiagonal divide-and-conquer eigensolver. Reduction to block tridiagonal form provides significantly lower execution times, as well as memory traffic and communication cost, over the traditional reduction to tridiagonal form for eigensystem computations.
机译:提出了两种并行块对角线化算法和稠密实对称矩阵的实现。块三对角化是用于计算本征系统的块三对角分治算法的关键预处理步骤,可用于需要矩阵中块结构效率的许多算法。对于“有效”的稀疏矩阵,该稀疏矩阵通常是由具有强局部性的应用程序导致的,使用启发式并行算法将其转换为块三对角矩阵,以使特征值误差仍然受到某些规定的精度公差的限制。对于没有任何可用结构的密集矩阵,可以使用正交变换来减少它以使用3级BLAS运算来阻止三对角形式。数值实验表明,从该算法获得的块三对角结构直接影响并行块三对角分治本征求解器的计算复杂度。简化为块对角线形式比本征系统计算的传统对角线对角线形式减少了,执行时间以及内存通信量和通信成本大大降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号