A Communication-Avoiding, Hybrid-Parallel, Rank-Revealing Orthogonalization Method

机译：一种避免通信的，混合并行，秩公开的正交化方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Orthogonalization consumes much of the run time of many iterative methods for solving sparse linear systems and eigenvalue problems. Commonly used algorithms, such as variants of Gram-Schmidt or Householder QR, have performance dominated by communication. Here, "communication" includes both data movement between the CPU and memory, and messages between processors in parallel. Our Tall Skinny QR (TSQR) family of algorithms requires asymptotically fewer messages between processors and data movement between CPU and memory than typical orthogonalization methods, yet achieves the same accuracy as Householder QR factorization. Furthermore, in block orthogonalizations, TSQR is faster and more accurate than existing approaches for orthogonalizing the vectors within each block ("normalization"). TSQR's rank-revealing capability also makes it useful for detecting deflation in block iterative methods, for which existing approaches sacrifice performance, accuracy, or both. We have implemented a version of TSQR that exploits both distributed-memory and shared-memory parallelism, and supports real and complex arithmetic. Our implementation is optimized for the case of orthogonalizing a small number (5 -- 20) of very long vectors. The shared-memory parallel component uses Intel's Threading Building Blocks, though its modular design supports other shared-memory programming models as well, including computation on the GPU. Our implementation achieves speedups of 2 times or more over competing orthogonalizations. It is available now in the development branch of the Trilinos software package, and will be included in the 10.8 release.

机译：正交化会耗费许多迭代方法来解决稀疏线性系统和特征值问题的大量运行时间。常用算法（例如Gram-Schmidt或Householder QR的变体）的性能以通信为主导。在此，“通信”既包括CPU与存储器之间的数据移动，也包括处理器之间的并行消息。我们的高瘦QR（TSQR）系列算法与典型的正交化方法相比，在处理器之间渐进地减少消息数量以及在CPU和内存之间进行数据移动所需的渐近次数更少，但达到了与Householder QR分解相同的准确性。此外，在块正交化中，TSQR比用于使每个块内的向量正交化的现有方法更快和更准确（“归一化”）。 TSQR的排名显示功能还使其可用于检测块迭代方法中的放气，而对于这些方法，现有方法会牺牲性能，准确性或两者兼而有之。我们已经实现了TSQR版本，该版本可同时利用分布式内存和共享内存并行性，并支持实数和复数算术。我们的实现针对正交化少量（5-20）非常长的向量的情况进行了优化。共享内存并行组件使用英特尔的线程构建模块，尽管其模块化设计也支持其他共享内存编程模型，包括GPU上的计算。与竞争性正交化相比，我们的实现可将速度提高2倍以上。它现在可在Trilinos软件包的开发分支中获得，并将包含在10.8版本中。

著录项

来源
《2011 25th IEEE International Parallel Distributed Processing Symposium》|2011年|p.966-977|共12页
会议地点
作者
Hoemmen Mark;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133;
关键词

相似文献

外文文献
中文文献
专利

1. Orthogonalized Infinite Edge Element Method—Convergence Improvement by Orthogonalization of Hilbert Matrix in Infinite Edge Element Method [J] . Tamitani S., Tsuzaki K., Wakao S., Magnetics, IEEE Transactions on . 2012,第2期

机译：正交化无限边单元法—通过无限边单元法对希尔伯特矩阵进行正交化来提高收敛性
2. Communication-Avoiding Optimization Methods for Distributed Massive-Scale Sparse Inverse Covariance Estimation [J] . Penporn Koanantakool, Alnur Ali, Ariful Azad, JMLR: Workshop and Conference Proceedings . 2018,第3期

机译：分布式大规模稀疏逆协方差估计的避免通信优化方法
3. Communication-Avoiding Optimization Methods for Distributed Massive-Scale Sparse Inverse Covariance Estimation [J] . Penporn Koanantakool, Alnur Ali, Ariful Azad, JMLR: Workshop and Conference Proceedings . 2018,第3期

机译：分布式大规模稀疏逆协方差估计的避免通信优化方法
4. A communication-avoiding, hybrid-parallel, rank-revealing orthogonalization method [C] . Mark Hoemmen IEEE International Parallel and Distributed Processing Symposium . 2011

机译：避免沟通，杂交平行，秩露天度正交化方法
5. Communication-avoiding Krylov subspace methods. [D] . Hoemmen, Mark. 2010

机译：避免通信的Krylov子空间方法。
6. Semiempirical Quantum-Chemical Orthogonalization-CorrectedMethods: Theory Implementation and Parameters [O] . PavloO. Dral, Xin Wu, Lasse Spörkel, -1

机译：半经验量子化学正交校正方法：理论实现和参数
7. Application of a Preconditioned Chebyshev Basis Communication-Avoiding Conjugate Gradient Method to a Multiphase Thermal-Hydraulic CFD Code [O] . Yasuhiro Idomura, Takuya Ina, Akie Mayumi, 2018

机译：预处理Chebyshev的应用避免缀合物梯度法在多相热液压CFD码中

A Communication-Avoiding, Hybrid-Parallel, Rank-Revealing Orthogonalization Method

摘要

著录项

相似文献

相关主题

期刊订阅