首页> 外文期刊>Scientific programming >A performance tuning methodology with compiler support
【24h】

A performance tuning methodology with compiler support

机译:具有编译器支持的性能调整方法

获取原文
获取原文并翻译 | 示例

摘要

We have developed an environment, based upon robust, existing, open source software, for tuning applications written using MPI, OpenMP or both. The goal of this effort, which integrates the OpenUH compiler and several popular performance tools, is to increase user productivity by providing an automated, scalable performance measurement and optimization system. In this paper we describe our environment, show how these complementary tools can work together, and illustrate the synergies possible by exploiting their individual strengths and combined interactions. We also present a methodology for performance tuning that is enabled by this environment. One of the benefits of using compiler technology in this context is that it can direct the performance measurements to capture events at different levels of granularity and help assess their importance, which we have shown to significantly reduce the measurement overheads. The compiler can also help when attempting to understand the performance results: it can supply information on how a code was translated and whether optimizations were applied. Our methodology combines two performance views of the application to find bottlenecks. The first is a high level view that focuses on OpenMP/MPI performance problems such as synchronization cost and load imbalances; the second is a low level view that focuses on hardware counter analysis with derived metrics that assess the efficiency of the code. Our experiments have shown that our approach can significantly reduce overheads for both profiling and tracing to acceptable levels and limit the number of times the application needs to be run with selected hardware counters. In this paper, we demonstrate the workings of this methodology by illustrating its use with selected NAS Parallel Benchmarks and a cloud resolving code.
机译:我们已经开发了一种基于健壮,现有的开源软件的环境,用于调整使用MPI,OpenMP或同时使用这两种方法编写的应用程序。集成OpenUH编译器和几种流行的性能工具的这项工作的目标是通过提供一个自动化的,可扩展的性能度量和优化系统来提高用户的生产率。在本文中,我们描述了我们的环境,展示了这些互补工具如何协同工作,并通过利用它们各自的优势和结合的相互作用来说明可能的协同作用。我们还介绍了此环境启用的性能调整方法。在这种情况下使用编译器技术的好处之一是,它可以指导性能测量以捕获不同粒度级别的事件并帮助评估它们的重要性,我们已经证明,这可以显着减少测量开销。尝试理解性能结果时,编译器也可以提供帮助:它可以提供有关如何翻译代码以及是否应用优化的信息。我们的方法结合了应用程序的两个性能视图来查找瓶颈。第一个是高层视图,重点关注OpenMP / MPI性能问题,例如同步成本和负载不平衡。第二个是低级视图,重点是使用硬件计数器分析以及评估代码效率的派生指标。我们的实验表明,我们的方法可以将性能分析和跟踪的开销显着减少到可接受的水平,并且可以限制使用选定的硬件计数器运行应用程序的次数。在本文中,我们通过说明其与选定的NAS并行基准和云解析代码的结合使用,演示了该方法的工作原理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号