首页> 外文会议>IEEE International Symposium on Parallel and Distributed Processing >Profiling Methodology and Performance Tuning of the Met Office Unified Model for Weather and Climate Simulations
【24h】

Profiling Methodology and Performance Tuning of the Met Office Unified Model for Weather and Climate Simulations

机译:拟议办公室统一模型的分析方法和性能调整天气和气候模拟

获取原文

摘要

Global weather and climate modelling is a compute-intensive task that is mission-critical to government departments concerned with meteorology and climate change. The dominant component of these models is a global atmosphere model. One such model, the Met Office Unified Model (MetUM), is widely used in both Europe and Australia for this purpose. This paper describes our experiences in developing an efficient profiling methodology and scalability analysis of the MetUM version 7.5 at both low scale and high scale atmosphere grid resolutions. Variability within the execution of the MetUM and variability of the run-time of identical jobs on a highly shared cluster are taken into account. The methodology uses a lightweight profiler internal to the MetUM which we have enhanced to have minimal overhead and enables accurate profiling with only a relatively modest usage of processor time. At high-scale resolution, the MetUM scaled to core counts of 2048, with load imbalance accounting a significant fraction the loss from ideal performance. Recent patches have removed two relatively small sources of inefficiency. Internal segment size parameters gave a modest performance improvement at low-scale resolution (such as are used in climate simulation); this however was not significant a higher scales. Near-square process grid configurations tended to give the best performance. Byte-swapping optimizations vastly improved I/O performance, which has in turn a large impact on performance in operational runs.
机译:全球天气和气候建模是一个计算密集型任务,对政府部门有关涉及气象和气候变化的关键任务。这些模型的主导组成部分是全球大气模型。一个这样的模型,Met Office Unified Model(Metum),广泛用于欧洲和澳大利亚以此目的。本文介绍了我们在低规模和高尺度大气网格分辨率下开发高效分析方法和可扩展性分析的经验。考虑到在高度共享群集中的相同作业的运行时间内执行的变化和可变性。该方法使用轻量级分析器内部到Metum,我们增强了最小的开销,并且可以仅具有相对较为适度的处理器时间的分析。在高尺度分辨率下,Metum缩放到2048的核心计数,负载不平衡占理想性能的损失的显着分数。最近的补丁已经消除了两个相对较小的低效率来源。内部段大小参数在低尺度分辨率下提供了适度的性能改善(例如用于气候模拟中使用的);然而,这并不重要较高的尺度。近方流程网格配置往往提供最佳性能。字节交换优化大大提高了I / O性能,这对操作运行的性能进行了很大影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号