首页> 外文会议>International Conference on Parallel Processing Workshops >Addressing Challenges in Visualizing Huge Call-Path Traces
【24h】

Addressing Challenges in Visualizing Huge Call-Path Traces

机译:应对可视化巨大的呼叫路径跟踪中的挑战

获取原文

摘要

Analysis and optimization of long-running applications on large-scale parallel systems is important to avoid unacceptable inefficiencies. Tracing is one of the most popular techniques for understanding the performance of parallel programs. Since tracing captures data in the time dimension, the size of a trace is linearly proportional to execution time. For that reason, traces of long-running executions of parallel programs may contain gigabytes or even terabytes of data. Presenting huge traces in a scalable fashion and identifying performance bottlenecks hidden in an ocean of data are challenging problems. To pinpoint performance bottlenecks effectively, a performance visualization tool needs to be relatively responsive and scalable. It also needs to be able to present both a global view of the performance of all threads and processes in a parallel execution, and a local view to see the full detail of a trace for an individual thread or process. Our approach to address this challenge is to use a client-server approach for trace visualization in hpctraceviewer, which is part of the HPC-TOOLKIT performance tools. This paper demonstrates the utility of our tool for identifying performance bottlenecks in large-scale executions through case studies with two Department of Energy procurement benchmarks: Algebraic Multi Grid (AMG) and Unstructured Mesh Transport (UMT) codes. Finally, the experiment shows that our implementation is scalable, rendering views of huge traces stored on remote supercomputers in a few seconds.
机译:对大型并行系统上长期运行的应用程序进行分析和优化,对于避免效率低下非常重要。跟踪是了解并行程序性能的最流行的技术之一。由于跟踪在时间维度上捕获数据,因此跟踪的大小与执行时间成线性比例。因此,并行程序长时间运行的痕迹可能包含千兆字节甚至数TB的数据。以可伸缩的方式显示巨大的痕迹并识别隐藏在数据海洋中的性能瓶颈是具有挑战性的问题。为了有效地找出性能瓶颈,性能可视化工具需要相对敏感和可扩展。它还需要能够呈现并行执行中所有线程和进程的性能的全局视图,以及局部视图,以查看单个线程或进程的跟踪的全部详细信息。我们解决此挑战的方法是使用客户端服务器方法在hpctraceviewer中进行跟踪可视化,这是HPC-TOOLKIT性能工具的一部分。本文通过案例研究和能源部的两个采购基准:代数多网格(AMG)和非结构化网格传输(UMT)代码,展示了我们的工具在大规模执行中识别性能瓶颈的工具的实用性。最后,实验表明我们的实现是可伸缩的,可以在几秒钟内呈现存储在远程超级计算机上的大量跟踪的视图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号