首页> 外文会议>IEEE/ACM International Symposium on Code Generation and Optimization >On performance debugging of unnecessary lock contentions on multicore processors: A replay-based approach
【24h】

On performance debugging of unnecessary lock contentions on multicore processors: A replay-based approach

机译:关于多核处理器上不必要的锁争用的性能调试:一种基于重播的方法

获取原文

摘要

Locks have been widely used as an effective synchronization mechanism among processes and threads. However, we observe that a large number of false inter-thread dependencies (i.e., unnecessary lock contentions) exist during the program execution on multicore processors, thereby incurring significant performance overhead. This paper presents a performance debugging framework, PerfPlay, to facilitate a comprehensive and in-depth understanding of the performance impact of unnecessary lock contentions. The core technique of our debugging framework is trace replay. Specifically, PerfPlay records the program execution trace, on the basis of which the unnecessary lock contentions can be identified through trace analysis. We then propose a novel technique of trace transformation to transform these identified unnecessary lock contentions in the original trace into the correct pattern as a new trace free of unnecessary lock contentions. Through replaying both traces, PerfPlay can quantify the performance impact of unnecessary lock contentions. To demonstrate the effectiveness of our debugging framework, we study five real-world programs and PARSEC benchmarks. Our experimental results demonstrate the significant performance overhead of unnecessary lock contentions, and the effectiveness of PerfPlay in identifying the performance critical unnecessary lock contentions in real applications.
机译:锁已被广泛用作进程和线程之间的有效同步机制。但是,我们观察到在多核处理器上执行程序的过程中存在大量错误的线程间依赖关系(即不必要的锁争用),从而导致显着的性能开销。本文介绍了一个性能调试框架PerfPlay,以帮助您全面而深入地了解不必要的锁争用对性能的影响。我们的调试框架的核心技术是跟踪重播。具体来说,PerfPlay记录程序执行跟踪,在此基础上可以通过跟踪分析来识别不必要的锁争用。然后,我们提出了一种跟踪转换的新技术,可以将原始跟踪中这些已标识的不必要的锁争用转换为正确的模式,作为没有不必要的锁争用的新跟踪。通过重播两条跟踪,PerfPlay可以量化不必要的锁争用对性能的影响。为了证明我们的调试框架的有效性,我们研究了五个实际程序和PARSEC基准。我们的实验结果证明了不必要的锁争用的显着性能开销,以及PerfPlay在识别实际应用程序中对性能至关重要的不必要的锁争用方面的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号