首页> 外文会议>AIAA modeling and simulation technologies conference;AIAA aviation forum >Parallelization of a Six Degree of Freedom Entry Vehicle Trajectory Simulation Using OpenMP and OpenACC
【24h】

Parallelization of a Six Degree of Freedom Entry Vehicle Trajectory Simulation Using OpenMP and OpenACC

机译:使用OpenMP和OpenACC的六自由度进​​入车辆轨迹仿真的并行化

获取原文

摘要

The art and science of writing parallelized software, using methods such as Open Multi-Processing (OpenMP) and Open Accelerators (OpenACC), is dominated by computer scientists. Engineers and non-computer scientists looking to apply these techniques to their project applications face a steep learning curve, especially when looking to adapt their original single threaded software to run multi-threaded on graphics processing units (GPUs). There are significant changes in mindset that must occur; such as how to manage memory, the organization of instructions, and the use of if statements (also known as branching). The purpose of this work is twofold: 1) to demonstrate the applicability of parallelized coding methodologies, OpenMP and OpenACC, to tasks outside of the typical large scale matrix mathematics; and 2) to discuss, from an engineer's perspective, the lessons learned from parallelizing software using these computer science techniques. This work applies OpenMP, on both multi-core central processing units (CPUs) and Intel® Xeon Phi™ 7210, and OpenACC on GPUs. These parallelization techniques are used to tackle the simulation of thousands of entry vehicle trajectories through the integration of six degree of freedom (DoF) equations of motion (EoM). The forces and moments acting on the entry vehicle, and used by the EoM, are estimated using multiple models of varying levels of complexity. Several benchmark comparisons are made on the execution of six DoF trajectory simulation: single thread Intel® Xeon® E5-2670 CPU, multi-thread CPU using OpenMP, multi-thread Xeon Phi™ 7210 using OpenMP, and multi-thread NVIDIA® Tesla® K40 GPU using OpenACC. These benchmarks are run on the Pleiades Supercomputer Cluster at the National Aeronautics and Space Administration (NASA) Ames Research Center (ARC), and a Xeon Phi™ 7210 node at NASA Langley Research Center (LaRC).
机译:使用诸如Open Multi-Processing(OpenMP)和Open Accelerators(OpenACC)之类的方法编写并行化软件的技术和科学受到计算机科学家的支配。希望将这些技术应用于其项目应用程序的工程师和非计算机科学家都面临着陡峭的学习曲线,尤其是当希望使其原始的单线程软件适应在图形处理单元(GPU)上运行多线程时。必须发生重大的观念变化;例如如何管理内存,指令的组织以及if语句的使用(也称为分支)。这项工作的目的是双重的:1)演示并行编码方法OpenMP和OpenACC对典型大规模矩阵数学之外的任务的适用性; 2)从工程师的角度讨论使用这些计算机科学技术并行化软件所获得的经验教训。这项工作将OpenMP应用于多核中央处理器(CPU)和Intel®Xeon Phi™7210,以及GPU上的OpenACC。这些并行化技术通过集成六个自由度(DoF)运动方程(EoM)来解决成千上万辆进入车辆轨迹的仿真问题。使用复杂程度不同的多个模型来估计由EoM使用的作用在入口车辆上的力和力矩。在执行六个DoF轨迹模拟时,进行了一些基准比较:单线程Intel®Xeon®E5-2670 CPU,使用OpenMP的多线程CPU,使用OpenMP的多线程Xeon Phi™7210和多线程NVIDIA®Tesla®使用OpenACC的K40 GPU。这些基准运行在美国国家航空航天局(NASA)艾姆斯研究中心(ARC)的Pleiades超级计算机集群上,以及NASA Langley研究中心(LaRC)的Xeon Phi™7210节点上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号