首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Clock delta compression for scalable order-replay of non-deterministic parallel applications
【24h】

Clock delta compression for scalable order-replay of non-deterministic parallel applications

机译:时钟增量压缩可用于不确定性并行应用程序的可扩展顺序重放

获取原文

摘要

The ability to record and replay program execution helps significantly in debugging non-deterministic MPI applications by reproducing message-receive orders. However, the large amount of data that traditional record-and-reply techniques record precludes its practical applicability to massively parallel applications. In this paper, we propose a new compression algorithm, Clock Delta Compression (CDC), for scalable record and replay of non-deterministic MPI applications. CDC defines a reference order of message receives based on a totally ordered relation using Lamport clocks, and only records the differences between this reference logical-clock order and an observed order. Our evaluation shows that CDC significantly reduces the record data size. For example, when we apply CDC to Monte Carlo particle transport Benchmark (MCB), which represents common non-deterministic communication patterns, CDC reduces the record size by approximately two orders of magnitude compared to traditional techniques and incurs between 13.1% and 25.5% of runtime overhead.
机译:记录和重播程序执行的能力通过重现消息接收顺序,极大地帮助调试不确定的MPI应用程序。但是,传统的记录和答复技术记录的大量数据妨碍了其对大规模并行应用程序的实际适用性。在本文中,我们提出了一种新的压缩算法,时钟增量压缩(CDC),用于可扩展性的记录和非确定性MPI应用程序的重放。 CDC使用Lamport时钟基于完全有序的关系定义消息接收的参考顺序,并且仅记录此参考逻辑时钟顺序与观察到的顺序之间的差异。我们的评估表明CDC大大减少了记录数据的大小。例如,当我们将CDC应用于代表通用非确定性通信模式的蒙特卡洛粒子传输基准(MCB)时,与传统技术相比,CDC将记录大小减少了大约两个数量级,并且导致CDC减少了13.1%至25.5%。运行时开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号