首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >Sparse Supernodal Solver Using Block Low-Rank Compression
【24h】

Sparse Supernodal Solver Using Block Low-Rank Compression

机译:使用块低级压缩的稀疏超墨求解器

获取原文

摘要

This paper presents two approaches using a Block Low-Rank (BLR) compression technique to reduce the memory footprint and/or the time-to-solution of the sparse supernodal solver PASTIX. This flat, non-hierarchical, compression method allows to take advantage of the low-rank property of the blocks appearing during the factorization of sparse linear systems, which come from the discretization of partial differential equations. The first approach, called Minimal Memory, illustrates the maximum memory gain that can be obtained with the BLR compression method, while the second approach, called Just-In-Time, mainly focuses on reducing the computational complexity and thus the time-to-solution. Singular Value Decomposition (SVD) and Rank-Revealing QR (RRQR), as compression kernels, are both compared in terms of factorization time, memory consumption, as well as numerical properties. Experiments on a single node with 24 threads and 128 GB of memory are presented on a set of matrices from real-life problems. We demonstrate a memory footprint reduction of up to 4:4 times using the Minimal Memory strategy and a computational time speedup of up to 3:3 times with the Just-In-Time strategy.
机译:本文使用块低级(BLR)压缩技术呈现了两种方法,以减少存储器占地面积和/或稀疏超氖求解器胶片的时间到溶液。这种平坦的非分层压缩方法允许利用在稀疏线性系统的分解期间出现的块的低秩属性,这来自部分微分方程的离散化。称为最小内存的第一种方法说明了可以用BLR压缩方法获得的最大存储器增益,而第二种方法,即在立即调用,主要侧重于降低计算复杂度,从而达到时间 - 解决方案。奇异值分解(SVD)和排名QR(RRQR)作为压缩内核,在分解时间,内存消耗以及数值方面都是比较的。在来自真实问题的一组矩阵上呈现了具有24个线程和128 GB的单个节点的实验。我们展示了使用最小内存策略的内存占据减少最多4:4次,并且计算时间加速高达3:3次,即在立即策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号