...
首页> 外文期刊>Journal of supercomputing >On the improvement of a scalable sparse direct solver for unsymmetrical linear equations
【24h】

On the improvement of a scalable sparse direct solver for unsymmetrical linear equations

机译:关于非对称线性方程组的可伸缩稀疏直接求解器的改进

获取原文
   

获取外文期刊封面封底 >>

       

摘要

This paper focuses on the application level improvements in a sparse direct solver specifically used for large-scale unsymmetrical linear equations resulting from unstructured mesh discretization of coupled elliptic/hyperbolic PDEs. Existing sparse direct solvers are designed for distributed server systems taking advantage of both distributed memory and processing units. We conducted extensive numerical experiments with three state-of-the-art direct linear solvers that can work on distributed-memory parallel architectures; namely, MUMPS(MUMPS solver website, http://graal.ens-lyon.fr/MUMPS), WSMP (Technical Report TR RC-21886, IBM, Watson Research Center, Yorktown Heights, 2000), and SUPERLU_DIST (ACM Trans Math Softw 29(2): 110-140, 2003). The performance of these solvers was analyzed in detail, using advanced analysis tools such as Tuning and Analysis Utilities (TAU) and Performance Application Programming Interface (PAPI). The performance is evaluated with respect to robustness, speed, scalability, and efficiency in CPU and memory usage. We have determined application level issues that we believe they can improve the performance of a distributed-shared memory hybrid variant of this solver, which is proposed as an alternative solver [SuperLU_MCDT (Many-Core Distributed)] in this paper. The new solver utilizing the MPI/OpenMP hybrid programming is specifically tuned to handle large unsymmetrical systems arising in reservoir simulations so that higher performance and better scalability can be achieved for a large distributed computing system with many nodes of multicore processors. Two main tasks are accomplished during this study: (i) comparisons of public domain solver algorithms; existing state-of-the-art direct sparse linear system solvers are investigated and their performance and weaknesses based on test cases are analyzed, (ii) improvement of direct sparse solver algorithm (SuperLU_MCDT) for many-core distributed systems is achieved. We provided results of numerical tests that were run on up to 16,384 cores, and used many sets of test matrices for reservoir simulations with unstructured meshes. The numerical results showed that SuperLU_MCDT can outperform SuperLU_DIST 3.3 in terms of both speed and robustness.
机译:本文重点研究稀疏直接求解器中的应用程序级别的改进,该求解器专门用于耦合椭圆/双曲线PDE的非结构化网格离散化所导致的大规模不对称线性方程。现有的稀疏直接求解器是为利用分布式内存和处理单元的分布式服务器系统而设计的。我们使用三个可以在分布式内存并行体系结构上使用的最先进的直接线性求解器进行了广泛的数值实验;即MUMPS(MUMPS求解器网站,http://graal.ens-lyon.fr/MUMPS),WSMP(技术报告TR RC-21886,IBM,沃森研究中心,约克敦高地,2000年)和SUPERLU_DIST(ACM Trans Math Softw 29(2):110-140,2003)。使用诸如调优和分析实用程序(TAU)和性能应用程序编程接口(PAPI)之类的高级分析工具,对这些求解器的性能进行了详细分析。在鲁棒性,速度,可伸缩性以及CPU和内存使用效率方面评估了性能。我们已经确定了应用程序级别的问题,我们认为它们可以提高此求解器的分布式共享内存混合变体的性能,在本文中将其作为替代解决方案[SuperLU_MCDT(Many-Core Distributed)]提出。利用MPI / OpenMP混合编程的新求解器经过专门调整,可处理储层模拟中出现的大型非对称系统,因此,对于具有多个多核处理器节点的大型分布式计算系统,可以实现更高的性能和更好的可伸缩性。在这项研究中完成了两个主要任务:(i)比较公共领域求解器算法;研究了现有的最先进的直接稀疏线性系统求解器,并根据测试案例分析了它们的性能和弱点,(ii)对多核分布式系统的直接稀疏求解器算法(SuperLU_MCDT)进行了改进。我们提供了可在多达16,384个岩心上进行的数值测试的结果,并使用了许多测试矩阵集用于非结构化网格的油藏模拟。数值结果表明,SuperLU_MCDT在速度和鲁棒性方面均优于SuperLU_DIST 3.3。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号