首页> 外文期刊>Future generation computer systems >Parallel and fully recursive multifrontal sparse Cholesky
【24h】

Parallel and fully recursive multifrontal sparse Cholesky

机译:并行且完全递归的多额稀疏Cholesky

获取原文
获取原文并翻译 | 示例

摘要

We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the BLAS and LAPACK. The new code is recursive at both the sparse and the dense levels, it uses a novel recursive data layout for dense submatrices, and it is parallelized using Cilk, an extension of C specifically designed to parallelize recursive codes. We demonstrate that the new code performs well and scales well on SMPs. In particular, on up to 16 processors, the code outperforms two state-of-the-art message-passing codes. The scalability and high performance that the code achieves imply that recursive schedules, blocked data layouts, and dynamic scheduling are effective in the implementation of sparse factorization codes.
机译:我们描述了新的并行稀疏Cholesky因式分解代码的设计,实现和性能。该代码使用了多前沿分解策略。尽管代码还可以使用BLAS和LAPACK,但是使用新的稠密矩阵子例程执行小稠密子矩阵的操作,这些子例程是代码的一部分。新代码在稀疏和密集级别上都是递归的,它对密集子矩阵使用新颖的递归数据布局,并使用Cilk(专门设计用于并行化递归代码的C扩展)进行并行化。我们证明了新代码在SMP上表现良好且可扩展。特别是,在最多16个处理器上,该代码的性能优于两个最新的消息传递代码。该代码实现的可伸缩性和高性能意味着递归调度,阻塞的数据布局和动态调度在稀疏分解代码的实现中有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号