首页> 外文会议>Parallel CFD 2002 Conference May 20-22, 2002 Kansai Science City, Japan >PARALLEL ARCHITECTURE AND ITS PERFORMANCE OF OCEANIC GLOBAL CIRCULATION MODEL BASED ON MOM3 TO BE RUN ON THE EARTH SIMULATOR
【24h】

PARALLEL ARCHITECTURE AND ITS PERFORMANCE OF OCEANIC GLOBAL CIRCULATION MODEL BASED ON MOM3 TO BE RUN ON THE EARTH SIMULATOR

机译:在地球模拟器上运行的基于MOM3的海洋全球环流模型的并行架构及其性能

获取原文
获取原文并翻译 | 示例

摘要

In this study, we will present latest re'sults from evaluation of our computational optimized code OFES based on MOM3 to run on the Earth Simulator. O(10) years integration with 0.1 degree for horizontal will be one of the first attempts to solve the largest scale scientific simulations. In order to keep the flexibility of MOM3 from points of scientific view, we consider two types of parallel architectures due to the difference from resolution to represent physical performance in oceanic phenomena. One is, for the relative lower resolved phenomena with longer integration time, characterized by using shared memory system for improvement parallel performance within a single node composed of 8PEs. To achieve the most efficiency parallel computation inside of a node, we modified MPI library into assembly coded library. Another parallel computational improvement, for case of ultra high resolution of 0. 1 degree for horizontal, employed by only communication with MPI library, which is not distinct from inside or outside of node. In this case, we took into account a mount of computation in halo region to attain to huge parallelized performance. As the results, the computational efficiency has been achieved high computational speed with more about 500 times performance comparing CPU time on a single node. The load imbalance was not recognized. In this paper, we will indicate optimization strategy for both two cases to attain target performance and results from measurement on the Earth Simulator. Experiments for ultra high resolution case carried out by using 188 nodes, which is composed of 1500 PEs.
机译:在这项研究中,我们将提供对基于MOM3的计算优化代码OFES进行评估以在Earth Simulator上运行的最新结果。 O(10)年与水平度为0.1度的积分将是解决大规模科学模拟的第一个尝试。为了从科学的角度保持MOM3的灵活性,我们考虑了两种并行架构,这是由于从分辨率上的差异来表示海洋现象的物理性能。一种是针对较长的集成时间的相对较低的已分解现象,其特征在于使用共享内存系统来改善由8PE组成的单个节点内的并行性能。为了在节点内部实现最高效率的并行计算,我们将MPI库修改为汇编代码库。对于水平为0. 1度的超高分辨率,另一种并行计算改进是仅与MPI库进行通信而采用的,与节点内部或外部没有区别。在这种情况下,我们考虑了晕圈区域的大量计算,以实现巨大的并行性能。结果,与单个节点上的CPU时间相比,计算效率已经达到了高计算速度,性能提高了约500倍。无法识别负载不平衡。在本文中,我们将说明两种情况下的优化策略,以实现目标性能和在Earth Simulator上的测量结果。通过使用由1500个PE组成的188个节点进行超高分辨率案例的实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号