首页> 外文期刊>Journal of Parallel and Distributed Computing >Revisiting parallel cyclic reduction and parallel prefix-based algorithms for block tridiagonal systems of equations
【24h】

Revisiting parallel cyclic reduction and parallel prefix-based algorithms for block tridiagonal systems of equations

机译:重新讨论块三对角方程组的并行循环归约和基于并行前缀的算法

获取原文
获取原文并翻译 | 示例
           

摘要

Direct solvers based on prefix computation and cyclic reduction algorithms exploit the special structure of tridiagonal systems of equations to deliver better parallel performance compared to those designed for more general systems of equations. This performance advantage is even more pronounced for block tridiagonal systems. In this paper, we re-examine the performances of these two algorithms taking the effects of block size into account. Depending on the block size, the parameter space spanned by the number of block rows, size of the blocks and the processor count is shown to favor one or the other of the two algorithms. A critical block size that separates these two regions is shown to emerge and its dependence both on problem dependent parameters and on machine-specific constants is established. Empirical verification of these analytical findings is carried out on up to 2048 cores of a Cray XT4 system.
机译:基于前缀计算和循环约简算法的直接求解器利用三对角方程组的特殊结构,与为更一般的方程组设计的并行性能相比,具有更好的并行性能。对于块状对角线系统,这种性能优势更为明显。在本文中,我们考虑到块大小的影响,重新检查了这两种算法的性能。根据块的大小,显示的参数空间由块行数,块的大小和处理器数量组成,显示为有利于两种算法中的一种。显示出将这两个区域分隔开的临界块大小,并且建立了其对问题相关参数和机器特定常数的依赖关系。这些分析结果的经验验证是在Cray XT4系统的多达2048个内核上进行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号