首页> 外文OA文献 >Profilage et débogage par prise de traces efficaces d'applications hybrides multi-threadées HPC
【2h】

Profilage et débogage par prise de traces efficaces d'applications hybrides multi-threadées HPC

机译:通过高效跟踪多线程混合HPC应用程序进行性能分析和调试

摘要

Supercomputers’ evolution is at the source of both hardware and software challenges. In the quest for the highest computing power, the interdependence in-between simulation components is becoming more and more impacting, requiring new approaches. This thesis is focused on the software development aspect and particularly on the observation of parallel software when being run on several thousand cores. This observation aims at providing developers with the necessary feedback when running a program on an execution substrate which has not been modeled yet because of its complexity. In this purpose, we firstly introduce the development process from a global point of view, before describing developer tools and related work. In a second time, we present our contribution which consists in a trace based profiling and debugging tool and its evolution towards an on-line coupling method which as we will show is more scalable as it overcomes IOs limitations. Our contribution also covers our time-stamp synchronisation algorithm for tracing purposes which relies on a probabilistic approach with quantified error. We also present a tool allowing machine characterisation from the MPI aspect and demonstrate the presence of machine noise for both point to point and collectives, justifying the use of an empirical approach. In summary, this work proposes and motivates an alternative approach to trace based event collection while preserving event granularity and a reduced overhead
机译:超级计算机的发展是硬件和软件挑战的根源。为了寻求最高的计算能力,仿真组件之间的相互依赖性正在变得越来越重要,需要新的方法。本文主要关注软件开发方面,尤其是在数千个内核上运行时对并行软件的观察。该观察旨在为开发人员在执行基板上运行程序时提供必要的反馈,该程序由于其复杂性尚未进行建模。为此,在描述开发人员工具和相关工作之前,我们首先从全局的角度介绍开发过程。第二次,我们介绍了我们的贡献,其中包括基于跟踪的性能分析和调试工具,以及它向在线耦合方法的发展,正如我们将展示的那样,由于克服了IO的局限性,它具有更大的可扩展性。我们的贡献还包括用于跟踪目的的时间戳同步算法,该算法依赖于带有量化误差的概率方法。我们还提供了一种工具,可以从MPI方面进行机械特征分析,并演示点对点和总体机械噪声的存在,证明使用经验方法是合理的。总而言之,这项工作提出并激发了一种替代方法来跟踪基于事件的事件,同时保留了事件的粒度和减少的开销

著录项

  • 作者

    Besnard Jean-Baptiste;

  • 作者单位
  • 年度 2014
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号