首页> 外文期刊>IEICE Transactions on Information and Systems >Deterministic Message Passing for Distributed Parallel Computing
【24h】

Deterministic Message Passing for Distributed Parallel Computing

机译:分布式并行计算的确定性消息传递

获取原文
获取原文并翻译 | 示例
           

摘要

The nondeterminism of message-passing communication brings challenges to program debugging, testing and fault-tolerance. This paper proposes a novel deterministic message-passing implementation (DMPI) for parallel programs in the distributed environment. DMPI is compatible with the standard MPI in user interface, and it guarantees the re-producibility of message with high performance. The basic idea of DMPI is to use logical time to solve message races and control asynchronous transmissions, and thus we could eliminate the nondeterministic behaviors of the existing message-passing mechanism. We apply a buffering strategy to alleviate the performance slowdown caused by mismatch of logical time and physical time. To avoid deadlocks introduced by deterministic mechanisms, we also integrate DMPI with a lightweight deadlock checker to dynamically detect and solve these deadlocks. We have implemented DMPI and evaluated it using NPB benchmarks. The results show that DMPI could guarantee determinism with incurring modest runtime overhead (14% on average).
机译:消息传递通信的不确定性给程序调试,测试和容错带来了挑战。本文为分布式环境中的并行程序提出了一种新颖的确定性消息传递实现(DMPI)。 DMPI与用户界面中的标准MPI兼容,并保证了高性能的消息可重现性。 DMPI的基本思想是使用逻辑时间解决消息争用并控制异步传输,因此我们可以消除现有消息传递机制的不确定性行为。我们应用缓冲策略来缓解由于逻辑时间和物理时间不匹配而导致的性能下降。为了避免确定性机制引入的死锁,我们还将DMPI与轻量级的死锁检查器集成在一起,以动态检测并解决这些死锁。我们已经实施了DMPI,并使用NPB基准对其进行了评估。结果表明,DMPI可以保证确定性,并具有适度的运行时开销(平均14%)。

著录项

  • 来源
    《IEICE Transactions on Information and Systems》 |2013年第5期|1068-1077|共10页
  • 作者单位

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

    School of Computer, National University of Defense Technology, Hunan, 410073, P.R. China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    determinism; message-passing; debugging; distributed com-putting;

    机译:确定性消息传递调试;分布式计算;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号