首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Clock delta compression for scalable order-replay of non-deterministic parallel applications
【24h】

Clock delta compression for scalable order-replay of non-deterministic parallel applications

机译:Clock Delta压缩以获得可扩展的顺序 - 重放非确定性并行应用

获取原文

摘要

The ability to record and replay program execution helps significantly in debugging non-deterministic MPI applications by reproducing message-receive orders. However, the large amount of data that traditional record-and-reply techniques record precludes its practical applicability to massively parallel applications. In this paper, we propose a new compression algorithm, Clock Delta Compression (CDC), for scalable record and replay of non-deterministic MPI applications. CDC defines a reference order of message receives based on a totally ordered relation using Lamport clocks, and only records the differences between this reference logical-clock order and an observed order. Our evaluation shows that CDC significantly reduces the record data size. For example, when we apply CDC to Monte Carlo particle transport Benchmark (MCB), which represents common non-deterministic communication patterns, CDC reduces the record size by approximately two orders of magnitude compared to traditional techniques and incurs between 13.1% and 25.5% of runtime overhead.
机译:记录和重播程序执行的能力有助于通过再现消息接收订单来调试非确定性MPI应用程序。然而,传统的记录和回复技术记录的大量数据排除了其对大规模平行应用的实际适用性。在本文中,我们提出了一种新的压缩算法,时钟Δ压缩(CDC),可用于缩放记录和重放非确定性MPI应用程序。 CDC定义了基于使用Lamport时钟的完全有序关系的消息接收的参考顺序,并且只记录该参考逻辑时钟顺序和观察顺序之间的差异。我们的评估表明,CDC显着降低了记录数据大小。例如,当我们将CDC应用于Monte Carlo粒子传输基准(MCB)时,表示常见的非确定性通信模式,与传统技术相比,CDC减少了大约两个数量级的记录大小,并且引发了13.1%和25.5%运行时开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号