首页> 外文会议> >Benchmark of parallelization methods for unstructured shock capturing code
【24h】

Benchmark of parallelization methods for unstructured shock capturing code

机译:非结构化震动捕获代码的并行化方法基准

获取原文
获取外文期刊封面目录资料

摘要

This paper presents benchmark results of three different parallel-programming paradigms on an unstructured shock capturing numerical code for transient problems. The three parallel programming methods include: (1) a shared-memory programming of OpenMP using cache coherent non-uniform memory access (CC-NUMA) of SGI Origin2000 (2) an MPI (Message Passing Interface) implementation and (3) a SHMEM implementation using the parallel library called "Shared Memory Access Library". The methods (2) and (3) are both based on distributed memory architecture. SGI Origin2000 is used throughout the current study. It is found that the scalability of the programming (1) is so poor that its usage for the unstructured CFD code is impractical. The scalabilities of programming (2) and (3) are much better than programming (1) and the computational speed of giga-flops range can be achieved with 16 CPUs. The parallel programming with SHMEM libraries is approximately twice as fast as the one with MPI.
机译:本文介绍了三种非并行编程范例的基准结果,这些范例是针对非结构化瞬变问题的冲击捕捉数字代码的。三种并行编程方法包括:(1)使用SGI Origin2000的缓存一致性非均匀内存访问(CC-NUMA)的OpenMP共享内存编程(2)MPI(消息传递接口)实现和(3)SHMEM使用称为“共享内存访问库”的并行库实现。方法(2)和(3)都基于分布式内存体系结构。在当前的研究中都使用了SGI Origin2000。发现编程(1)的可伸缩性非常差,以至于将其用于非结构化CFD代码是不切实际的。编程(2)和(3)的可扩展性比编程(1)更好,并且千兆位范围的计算速度可以用16个CPU来实现。使用SHMEM库进行并行编程的速度大约是使用MPI进行并行编程的速度的两倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号