首页> 外文会议>International Workshop on Embedded Multicore Systems >Addressing Challenges in Visualizing Huge Call-Path Traces
【24h】

Addressing Challenges in Visualizing Huge Call-Path Traces

机译:解决可视化巨大呼叫路径迹线的挑战

获取原文

摘要

Analysis and optimization of long-running applications on large-scale parallel systems is important to avoid unacceptable inefficiencies. Tracing is one of the most popular techniques for understanding the performance of parallel programs. Since tracing captures data in the time dimension, the size of a trace is linearly proportional to execution time. For that reason, traces of long-running executions of parallel programs may contain gigabytes or even terabytes of data. Presenting huge traces in a scalable fashion and identifying performance bottlenecks hidden in an ocean of data are challenging problems. To pinpoint performance bottlenecks effectively, a performance visualization tool needs to be relatively responsive and scalable. It also needs to be able to present both a global view of the performance of all threads and processes in a parallel execution, and a local view to see the full detail of a trace for an individual thread or process. Our approach to address this challenge is to use a client-server approach for trace visualization in hpctraceviewer, which is part of the HPC-TOOLKIT performance tools. This paper demonstrates the utility of our tool for identifying performance bottlenecks in large-scale executions through case studies with two Department of Energy procurement benchmarks: Algebraic Multi Grid (AMG) and Unstructured Mesh Transport (UMT) codes. Finally, the experiment shows that our implementation is scalable, rendering views of huge traces stored on remote supercomputers in a few seconds.
机译:大规模并行系统上长时间运行应用的分析和优化对于避免不可接受的低效率非常重要。跟踪是了解并行程序性能的最流行的技术之一。由于追踪在时间尺寸中捕获数据,因此迹线的大小与执行时间线性成比例。因此,并行程序的长期执行执行的迹线可能包含千兆字节甚至数据的千兆字节。以可扩展的方式呈现巨大的迹线,并识别隐藏在数据海洋中的性能瓶颈是挑战性问题。为了有效地确定性能瓶颈,性能可视化工具需要相对响应和可扩展。它还需要能够在并行执行中呈现所有线程和进程的性能的全局视图,以及本地视图,以查看单个线程或过程的迹线的完整细节。我们解决这一挑战的方法是使用HPCtraceViewer中的跟踪可视化的客户端 - 服务器方法,这是HPC-Toolkit性能工具的一部分。本文展示了我们工具的效用,通过案例研究识别大规模执行中的性能瓶颈,与两个能量采购基准,代数多网格(AMG)和非结构化网格传输(UMT)代码。最后,实验表明,我们的实现是可扩展的,渲染在几秒钟内存储在远程超级计算机上的大规模迹线的视图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号