...
首页> 外文期刊>Distributed Computing >Timestamping messages and events in a distributed system using synchronous communication
【24h】

Timestamping messages and events in a distributed system using synchronous communication

机译:使用同步通信在分布式系统中标记消息和事件的时间戳

获取原文
获取原文并翻译 | 示例

摘要

Determining order relationship between events of a distributed computation is a fundamental problem in distributed systems which has applications in many areas including debugging, visualization, checkpointing and recovery. Fidge/Mattern's vector-clock mechanism captures the order relationship using a vector of size N in a system consisting of N processes. As a result, it incurs message and space overhead of N integers. Many distributed applications use synchronous messages for communication. It is therefore natural to ask whether it is possible to reduce the timestamping overhead for such applications. In this paper, we present a new approach for timestamping messages and events of a synchronously ordered computation, that is, when processes communicate using synchronous messages. Our approach depends on decomposing edges in the communication topology into mutually disjoint edge groups such that each edge group either forms a star or a triangle. We show that, to accurately capture the order relationship between synchronous messages, it is sufficient to use one component per edge group in the vector instead of one component per process. Timestamps for events are only slightly bigger than timestamps for messages. Many common communication topologies such as ring, grid and hypercube can be decomposed into [N/2] edge groups, resulting in almost 50% improvement in both space and communication overheads. We prove that the problem of computing an optimal edge decomposition of a communication topology is NP-complete in general. We also present a heuristic algorithm for computing an edge decomposition whose size is within a factor of two of the optimal. We prove that, in the worst case, it is not possible to timestamp messages of a synchronously ordered computation using a vector containing fewer than 2[N/6] components when N ≥ 2. Finally, we show that messages in a synchronously ordered computation can always be timestamped in an offline manner using a vector of size at most [N/2].
机译:确定分布式计算的事件之间的顺序关系是分布式系统中的一个基本问题,分布式系统在许多领域都有应用,包括调试,可视化,检查点和恢复。 Fidge / Mattern的向量时钟机制在由N个进程组成的系统中使用大小为N的向量捕获顺序关系。结果,它导致N个整数的消息和空间开销。许多分布式应用程序使用同步消息进行通信。因此,自然会问到是否有可能减少此类应用程序的时间戳开销。在本文中,我们提出了一种对消息和事件进行时间戳的新方法,这些消息和事件属于同步排序的计算,即当进程使用同步消息进行通信时。我们的方法依赖于将通信拓扑中的边缘分解为互不相交的边缘组,以使每个边缘组形成星形或三角形。我们表明,要准确地捕获同步消息之间的顺序关系,在向量中的每个边缘组使用一个组件,而不是在每个进程中使用一个组件就足够了。事件的时间戳仅比消息的时间戳略大。许多常见的通信拓扑(例如环形,网格和超立方体)可以分解为[N / 2]个边缘组,从而在空间和通信开销方面几乎提高了50%。我们证明,计算通信拓扑的最佳边缘分解问题通常是NP完全的。我们还提出了一种启发式算法,用于计算边缘分解,其大小在最佳值的两倍之内。我们证明,在最坏的情况下,当N≥2时,不可能使用包含少于2 [N / 6]个分量的向量来为同步排序的计算消息加时间戳。最后,我们证明了同步排序的计算中的消息始终可以使用最大[N / 2]个大小的向量以离线方式加盖时间戳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号