首页> 外文会议>IEEE International Conference on High Performance Computing and Communications >Performance Analysis of CFD Application Cart3D Using MPInside and Performance Monitor Unit Data on Nehalem and Westmere Based Supercomputers
【24h】

Performance Analysis of CFD Application Cart3D Using MPInside and Performance Monitor Unit Data on Nehalem and Westmere Based Supercomputers

机译:CFD应用Cart3D使用MPINSIDE和Performance监视器单元数据对Nehalem和Westmere基于超级计算机的性能分析

获取原文
获取外文期刊封面目录资料

摘要

Cart3D is a computational fluid dynamics (CFD) application aimed at conceptual and preliminary design of aerospace vehicles with complex geometries. It is widely used by design engineers at NASA, Department of Defense and aerospace companies in the USA. We present detailed performance analysis of Cart3D using two tools SGI MPInside and op_scope that collects hardware counter data from Intel Performance Monitoring Unit (PMU) on supercomputers based on Nehalem micro-architecture. Using these tools, we have done dynamic profiling of Cart3D (compute time, communication time and I/O time), along with dynamic profiling of MPI functions (MPI_Sendrecv, MPI_Bcast, MPI_Isend, MPI_Irecv, MPI_Allreduce, MPI_Barrier, etc.) with respect to message size of each rank and time consumed by each function. MPI communication is further analyzed by studying the performance of MPI functions used in this application as a function of message size and number of cores. Using these tools we have also studied efficiency of the processor to measure its effective utilization, efficiency of the floating-point units, percentage of vectorization and percentage of data coming from L2 cache, L3 cache, and main memory. This study was performed on two computing sub-systems based on quad-core Nehalem-EP and hex-core West mere-EP processors that are part of Pleiades an SGI Altix ICE at NASA Ames Research Center.
机译:Cart3D是一种计算流体动力学(CFD)应用,其旨在具有复杂几何形状的航空航天车辆的概念和初步设计。美国宇航局的设计工程师广泛应用于美国国防部和航空航天公司的设计工程师。我们使用两种工具SGI MPINSIDE和OP_SCOPE对CART3D的详细性能分析和OP_SCOPE在基于Nehalem Micro架构的超级计算机上收集来自英特尔性能监控单元(PMU)的硬件计数器数据。利用这些工具,我们已经做了动态评测Cart3D(计算时间,通信时间和I / O时间),用尊重的MPI函数(MPI_Sendrecv,MPI_Bcast,MPI_Isend,MPI_Irecv,MPI_Allreduce,MPI_Barrier等)动态纹沿每个函数消耗的每个等级和时间的消息大小。通过研究本申请中使用的MPI函数作为消息大小和核数的函数来进一步分析MPI通信。使用这些工具,我们还研究了处理器的效率来衡量其有效利用率,浮点单元的效率,矢量化百分比和来自L2缓存,L3缓存和主存储器的数据百分比。该研究是在基于四核Nehalem-EP和十六进制核心西部仅用于普利奥的一部分Pleiades A SGI Altix Ice的一部分,在美国宇航局Ames研究中心的一部分进行了该研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号