首页> 外文期刊>Journal of Parallel and Distributed Computing >Mixing LU and QR factorization algorithms to design high-performance dense linear algebra solvers
【24h】

Mixing LU and QR factorization algorithms to design high-performance dense linear algebra solvers

机译:混合LU和QR分解算法以设计高性能的密集线性代数求解器

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper introduces hybrid LU-QR algorithms for solving dense linear systems of the form Ax = b. Throughout a matrix factorization, these algorithms dynamically alternate LU with local pivoting and QR elimination steps based upon some robustness criterion. LU elimination steps can be very efficiently parallelized, and are twice as cheap in terms of floating-point operations, as QR steps. However, LU steps are not necessarily stable, while QR steps are always stable. The hybrid algorithms execute a QR step when a robustness criterion detects some risk for instability, and they execute an LU step otherwise. The choice between LU and QR steps must have a small computational overhead and must provide a satisfactory level of stability with as few QR steps as possible. In this paper, we introduce several robustness criteria and we establish upper bounds on the growth factor of the norm of the updated matrix incurred by each of these criteria. In addition, we describe the implementation of the hybrid algorithms through an extension of the PaRSEC software to allow for dynamic choices during execution. Finally, we analyze both stability and performance results compared to state-of-the-art linear solvers on parallel distributed multicore platforms. A comprehensive set of experiments shows that hybrid LU-QR algorithms provide a continuous range of trade-offs between stability and performances.
机译:本文介绍了混合LU-QR算法,用于求解Ax = b形式的稠密线性系统。在整个矩阵分解过程中,这些算法根据一些鲁棒性标准动态地将LU与局部透视和QR消除步骤交替使用。 LU消除步骤可以非常有效地并行化,并且在浮点运算方面的价格是QR步骤的两倍。但是,LU步长不一定稳定,而QR步长总是稳定的。当鲁棒性标准检测到不稳定的风险时,混合算法执行QR步骤,否则执行LU步骤。 LU和QR步骤之间的选择必须具有较小的计算开销,并且必须以尽可能少的QR步骤提供令人满意的稳定性。在本文中,我们介绍了几个鲁棒性准则,并确定了每个准则所导致的更新矩阵范数的增长因子的上限。此外,我们通过扩展PaRSEC软件来描述混合算法的实现,以允许在执行过程中进行动态选择。最后,与并行分布式多核平台上的最新线性求解器相比,我们同时分析了稳定性和性能结果。一组全面的实验表明,混合LU-QR算法在稳定性和性能之间提供了一系列的权衡取舍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号