首页> 外文会议>IEEE International Conference on Services Computing >Dynamic Job Replication for Balancing Fault Tolerance, Latency, and Economic Efficiency: Work in Progress
【24h】

Dynamic Job Replication for Balancing Fault Tolerance, Latency, and Economic Efficiency: Work in Progress

机译:动态作业复制以平衡容错,延迟和经济效率:正在进行的工作

获取原文

摘要

Recent research has demonstrated the benefits of replication of requests with canceling, which initiates multiple concurrent replicas of a request and uses the first successful result, immediately removing the remaining replicas of the completed request from the system. This paper suggests that the benefits of replication may come at the risk of an abrupt system transition to an undesirable highly congested equilibrium. To expose, evaluate, and ultimately manage these risk/benefit trade-offs, we generalize the replication strategy by: (a) accounting for the possible inefficiency of "remote" service, (b) allowing replication only when static routing fails to identify an idle "local" server, and (c) requiring one or more replicas of the same request to be completed to improve fault tolerance using a majority rule decision. Due to the intractability of the Markov performance model, our analysis is based on mean-field and fluid approximations. Future research should evaluate the accuracy of assertions based on these approximations, and ultimately develop practical solutions for optimization of various performance trade-offs in distributed systems with replication.
机译:最近的研究表明,取消复制请求的好处是,它可以发起一个请求的多个并发副本,并使用第一个成功的结果,立即从系统中删除已完成请求的其余副本。本文建议复制的好处可能会出现系统突然过渡到不良的高度拥塞平衡的风险。为了揭示,评估并最终管理这些风险/利益的折衷,我们通过以下方式归纳了复制策略:(a)考虑“远程”服务的可能效率低下,(b)仅在静态路由无法识别出一个闲置的“本地”服务器,以及(c)使用多数规则决策来完成同一请求的一个或多个副本以提高容错能力。由于马尔可夫性能模型的难处理性,我们的分析基于均值场和流体近似。未来的研究应基于这些近似值评估断言的准确性,并最终开发出实用的解决方案,以优化具有复制功能的分布式系统中的各种性能折衷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号