首页> 外文会议>ACM international conference on distributed event-based systems >Distributed Middleware Reliability and Fault Tolerance Support in System S
【24h】

Distributed Middleware Reliability and Fault Tolerance Support in System S

机译:系统S中分布式中间件可靠性和容错支持

获取原文

摘要

We describe a fault-tolerance technique for implementing operations in a large-scale distributed system that ensures that all the components will eventually have a consistent view of the system even in the face of component failures. To achieve this, we break the distributed operation into a series of smaller operations, each of which is local to a single component, carefully linked together. Thus, the effect of a component failure and restart in the middle of a multi-component operation is limited to that component and its immediate neighbors. This framework is used in System S, a commercial grade stream processing platform. In that context we will show empirically that our approach is effective and imposes low overhead on distributed inter-component operations.
机译:我们描述了一种用于在大规模分布式系统中实现操作的容错技术,其确保即使在组件故障面上也最终将具有一致的系统视图。为此,我们将分布式操作分解为一系列较小的操作,每个操作都是单个组件的本地,仔细连接在一起。因此,组件故障和重新启动在多分量操作的中间的效果仅限于该组件及其立即邻居。该框架用于系统S,商业级流处理平台。在这方面,我们将凭经验展示我们的方法是有效的,并对分布式组件间操作产生低开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号