首页> 外文OA文献 >Performance Analysis of a Hybrid Overset Multi-Block Application on Multiple Architectures
【2h】

Performance Analysis of a Hybrid Overset Multi-Block Application on Multiple Architectures

机译:多种架构上的混合溢出多块应用程序的性能分析

摘要

This paper presents a detailed performance analysis of a multi-block overset grid compu- tational fluid dynamics app!ication on multiple state-of-the-art computer architectures. The application is implemented using a hybrid MPI+OpenMP programming paradigm that exploits both coarse and fine-grain parallelism; the former via MPI message passing and the latter via OpenMP directives. The hybrid model also extends the applicability of multi-block programs to large clusters of SNIP nodes by overcoming the restriction that the number of processors be less than the number of grid blocks. A key kernel of the application, namely the LU-SGS linear solver, had to be modified to enhance the performance of the hybrid approach on the target machines. Investigations were conducted on cacheless Cray SX6 vector processors, cache-based IBM Power3 and Power4 architectures, and single system image SGI Origin3000 platforms. Overall results for complex vortex dynamics simulations demonstrate that the SX6 achieves the highest performance and outperforms the RISC-based architectures; however, the best scaling performance was achieved on the Power3.
机译:本文介绍了在多种最新计算机体系结构上的多块过冲网格计算流体动力学应用程序的详细性能分析。该应用程序使用混合MPI + OpenMP编程范例来实现,该范例利用了粗糙和精细的并行性。前者通过MPI消息传递而后者通过OpenMP指令传递。通过克服处理器数量少于网格块数量的限制,混合模型还将多块程序的适用性扩展到了SNIP节点的大型集群。必须修改应用程序的关键内核LU-SGS线性求解器,以增强目标机上混合方法的性能。对无缓存Cray SX6矢量处理器,基于缓存的IBM Power3和Power4体系结构以及单系统映像SGI Origin3000平台进行了研究。复杂涡流动力学仿真的总体结果表明,SX6具有最高的性能,并且优于基于RISC的体系结构。但是,在Power3上实现了最佳的缩放性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号