首页> 外文会议>AIAA aerospace sciences meeting;AIAA SciTech Forum >Enabling Performance Optimization of CFD Applications with Source-to-Source Compiler Technology
【24h】

Enabling Performance Optimization of CFD Applications with Source-to-Source Compiler Technology

机译:使用源到源编译器技术实现CFD应用程序的性能优化

获取原文

摘要

This paper presents a new programming approach of applying source-to-source compiler technology to parallelize a computational fluid dynamics flow solver. It is becoming increasingly difficult for programmers to optimize large scale applications for modern computer architectures, due to the increasing complexity of these architectures. This is especially a concern when the application uses large amounts of legacy code. Instead of tediously optimizing the parallel implementation for each new architecture, translators based on a source-to-source compiler are designed to perform the implementation. The translators allow for construction of custom programming models; these custom models are more effective on complex architectures because they can incorporate more domain-specific knowledge, versus general-purpose programming models. We apply this concept to a multi-stage multigrid finite-volume algorithm. The translator, written with the ROSE compiler, will replace sequences of instructions in the serial algorithm with QUARK tasks. QUARK is a dynamic run time system that executes tasks asynchronously while avoiding data hazards. Using a dynamically-scheduled task-based parallel method allows for easier load-balancing of complex algorithms on complex computer architectures. The overall strategy is shown to increase the performance, programmability, and portability of CFD applications.
机译:本文提出了一种应用源到源编译器技术来并行化计算流体动力学流求解器的新编程方法。由于现代计算机体系结构的复杂性不断提高,因此使程序员难以优化大规模应用程序。当应用程序使用大量的旧代码时,这尤其值得关注。并非单调乏味地为每种新体系结构优化并行实现,而是设计了基于源到源编译器的转换器来执行该实现。翻译器允许构建定制的编程模型;这些自定义模型在复杂的体系结构上更有效,因为与通用编程模型相比,它们可以包含更多领域特定的知识。我们将此概念应用于多级多网格有限体积算法。用ROSE编译器编写的翻译器将用QUARK任务替换串行算法中的指令序列。 QUARK是一个动态运行时系统,可以异步执行任务,同时避免数据危害。使用动态调度的基于任务的并行方法,可以简化复杂计算机体系结构上复杂算法的负载平衡。总体策略显示可以提高CFD应用程序的性能,可编程性和可移植性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号