首页> 外文会议>International Euro-Par Conference on Parallel Processing >TA UoverSupermon: Low-Overhead Online Parallel Performance Monitoring
【24h】

TA UoverSupermon: Low-Overhead Online Parallel Performance Monitoring

机译:TA UOVESUPERMON:低开销在线并行性能监控

获取原文

摘要

Online application performance monitoring allows tracking performance characteristics during execution as opposed to doing so post-mortem. This opens up several possibilities otherwise unavailable such as real-time visualization and application performance steering that can be useful in the context of long-running applications. As HPC systems grow in size and complexity, the key challenge is to keep the online performance monitor scalable and low overhead while still providing a useful performance reporting capability. Two fundamental components that constitute such a performance monitor are the measurement and transport systems. We adapt and combine two existing, mature systems -^sTAU and Supermon - to address this problem. TAU performs the measurement while Supermon is used to collect the distributed measurement state. Our experiments show that this novel approach leads to very low-overhead application monitoring as well as other benefits unavailable from using a transport such as NFS.
机译:在线应用程序性能监控允许在执行期间跟踪性能特征,而不是执行此类后验证。这使得若干可能性不可用,例如实时可视化和应用程序性能转向,这在长期运行的应用程序的上下文中可以是有用的。随着HPC系统的大小和复杂性,关键挑战是保持在线性能监视器可扩展和低开销,同时仍提供有用的性能报告能力。构成这种性能监视器的两个基本组件是测量和运输系统。我们适应并结合两个现有,成熟的系统 - ^斯劳和超级举措 - 解决这个问题。 TAU执行测量,而超常MON用于收集分布式测量状态。我们的实验表明,这种新颖的方法导致非常低的应用程序监控以及使用诸如NFS等运输的其他益处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号