...
首页> 外文期刊>Concurrency and computation: practice and experience >Parallel multigrid on hierarchical hybrid grids: a performance study on current high performance computing clusters
【24h】

Parallel multigrid on hierarchical hybrid grids: a performance study on current high performance computing clusters

机译:分层混合网格上的并行多重网格:当前高性能计算集群的性能研究

获取原文
获取原文并翻译 | 示例
           

摘要

This article studies the performance and scalability of a geometric multigrid solver implemented within thernhierarchical hybrid grids (HHG) software package on current high performance computing clusters up tornnearly 300, 000 cores. HHG is based on unstructured tetrahedral finite elements that are regularly refinedrnto obtain a block-structured computational grid. One challenge is the parallel mesh generation from anrnunstructured input grid that roughly approximates a human head within a 3D magnetic resonance imagingrndata set. This grid is then regularly refined to create the HHG grid hierarchy. As test platforms, a BlueGene/Prncluster located at Jülich supercomputing center and an Intel Xeon 5650 cluster located at the local computingrncenter in Erlangen are chosen. To estimate the quality of our implementation and to predict runtime for thernmultigrid solver, a detailed performance and communication model is developed and used to evaluate thernmeasured single node performance, as well as weak and strong scaling experiments on both clusters. Thus,rnfor a given problem size, one can predict the number of compute nodes that minimize the overall runtime ofrnthe multigrid solver. Overall, HHG scales up to the full machines, where the biggest linear system solved onrnJugene had more than one trillion unknowns.
机译:本文研究了在多达300、000个核心的当前高性能计算集群上的分层混合网格(HHG)软件包中实现的几何多网格求解器的性能和可伸缩性。 HHG基于规则化的非结构化四面体有限元以获得块结构化计算网格。一个挑战是从非结构化输入网格生成并行网格,该网格粗略地逼近3D磁共振成像数据集中的人头。然后定期对该网格进行优化以创建HHG网格层次结构。作为测试平台,选择了位于Jülich超级计算中心的BlueGene / Prncluster和位于Erlangen本地计算中心的Intel Xeon 5650集群。为了评估实现的质量并预测多网格求解器的运行时间,开发了详细的性能和通信模型,并将其用于评估测得的单节点性能以及两个群集上的弱缩放实验和强缩放实验。因此,对于给定的问题大小,可以预测使多网格求解器的总体运行时间最小化的计算节点数。总体而言,HHG可以扩展到整个机器,在Jugene解决的最大线性系统中,未知数超过一万亿。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号