...
首页> 外文期刊>International Journal of Computational Science and Engineering >Porting the MPI-parallelised LES model PALM to multi-GPU systems and many integrated core processors - an experience report
【24h】

Porting the MPI-parallelised LES model PALM to multi-GPU systems and many integrated core processors - an experience report

机译:将MPI平行化的LES模型手掌移植到多GPU系统和许多集成核心处理器 - 体验报告

获取原文
获取原文并翻译 | 示例

摘要

The computational power and availability of graphics processing units (GPUs) and many integrated core (MIC) processors on high performance computing (HPC) systems is rapidly evolving. However, HPC applications need to be ported to take advantage of such hardware. This paper is a report on our experience of porting the MPI+OpenMP parallelised large-eddy simulation model (PALM) to multi-GPU as well as to MIC processor environments using OpenACC and OpenMP. PALM is written in Fortran, entails 140 kLOC and runs on HPC farms of up to 43,200 cores. The main porting challenges are the size and complexity of PALM, its inconsistent modularisation and no unit-tests. We report the methods used to identify performance issues as well as our experiences with state-of-the-art profiling tools. Moreover, we outline the required porting steps, describe the problems and bottlenecks we encountered and present separate performance tests for both architectures. We however, do not provide benchmark information.
机译:高性能计算(HPC)系统上的图形处理单元(GPU)和许多集成核心(MIC)处理器的计算能力和可用性正在快速发展。但是,需要移植HPC应用程序以利用此类硬件。本文是我们使用OpenACC和OpenMP将MPI + OpenMP并行大型仿真模型(Palm)移植到多GPU以及MIC处理器环境的经验的报告。 Palm是用Fortran编写的,需要140 kloc,并在高达43,200个核心的HPC农场上运行。主要的移植挑战是手掌的大小和复杂性,其不一致的模块化和没有单位测试。我们报告用于识别绩效问题的方法以及我们最先进的分析工具的经验。此外,我们概述了所需的移植步骤,描述我们遇到的问题和瓶颈,并为这两个架构提供单独的性能测试。但是,我们不提供基准信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号