首页> 外文OA文献 >gamer-2: a GPU-accelerated adaptive mesh refinement code – accuracy, performance, and scalability
【2h】

gamer-2: a GPU-accelerated adaptive mesh refinement code – accuracy, performance, and scalability

机译:Gamer-2:GPU加速的自适应网格精制码 - 精度,性能和可扩展性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present GAMER-2, a GPU-accelerated adaptive mesh refinement (AMR) code forastrophysics. It provides a rich set of features, including adaptivetime-stepping, several hydrodynamic schemes, magnetohydrodynamics,self-gravity, particles, star formation, chemistry and radiative processes withGRACKLE, data analysis with yt, and memory pool for efficient objectallocation. GAMER-2 is fully bitwise reproducible. For the performanceoptimization, it adopts hybrid OpenMP/MPI/GPU parallelization and utilizes thesimultaneity between CPU computation, GPU computation, and CPU-GPUcommunication. Load balancing is achieved using a Hilbert space-filling curveon a level-by-level basis without the need to duplicate the entire AMRhierarchy on each MPI process. To provide convincing demonstrations of theaccuracy and performance of GAMER-2, we directly compare with Enzo on isolateddisk galaxy simulations and with FLASH on galaxy cluster merger simulations. Weshow that the physical results obtained by different codes are in very goodagreement, and GAMER-2 outperforms Enzo and FLASH by nearly one and two ordersof magnitude, respectively, on the Blue Waters supercomputers using $1-256$nodes. More importantly, GAMER-2 exhibits similar or even better parallelscalability compared to the other two codes. We also demonstrate good weak andstrong scaling using up to 4096 GPUs and 65,536 CPU cores, and achieve auniform resolution as high as $10{,}240^3$ cells. Furthermore, GAMER-2 can beadopted as an AMR+GPUs framework and has been extensively used for the wavedark matter ($psi$DM) simulations. GAMER-2 is open source (available athttps://github.com/gamer-project/gamer) and new contributions are welcome.
机译:我们展示了Gamer-2,一个GPU加速的自适应网格细化(AMR)代码饰品。它提供了丰富的特征,包括适应性步进,多种流体动力学,磁力流体动力学,自重,粒子,星形形成,化学和辐射过程,与YT的数据分析,以及用于高效目标的内存池。 Gamer-2是完全编辑的。对于PerformanceOptimization,它采用混合型OpenMP / MPI / GPU并行化,并利用CPU计算,GPU计算和CPU-GPucocomation之间的杂志。使用HILBERT Space-Filling Curveon实现逐级级别的负载平衡,而无需在每个MPI过程上复制整个AMRIERARCH。为了提供令人信服的TheAccuracy和Gamer-2表演的演示,我们直接与Enzo相比,在隔离的星系模拟和Galaxy Cluster合并模拟中使用Flash。 WELEWERY CODES获得的物理结果在非常适合于使用$ 1-256 $节点的蓝色水上计算机超级计算机上分别非常代码,并且游戏玩家-2优于eNZO和闪存,分别通过近一个和两个数量幅度。更重要的是,与另外两种代码相比,Gamer-2展现了类似甚至更好的平价能力。我们还展示了使用多达4096个GPU和65,536个CPU核心的良好弱和激奏扩展,并实现了高达10美元{,} 240 ^ $细胞。此外,Gamer-2可以作为AMR + GPU框架Beadopted,并已广泛用于WaveDark物质($ PSI $ DM)模拟。 Gamer-2是开源的(可用的Athttps://github.com/gamer-project/gamer)和欢迎新贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号