首页> 外文期刊>Parallel Computing >On the efficacy, efficiency and emergent behavior of task replication in large distributed systems
【24h】

On the efficacy, efficiency and emergent behavior of task replication in large distributed systems

机译:大型分布式系统中任务复制的效率,效率和紧急行为

获取原文
获取原文并翻译 | 示例

摘要

Large distributed systems challenge traditional schedulers, as it is often hard to determine a priori how long each task will take to complete on each resource, information that is input for such schedulers. Task replication has been applied in a variety of scenarios as a way to circumvent this problem. Task replication consists of dispatching multiple replicas of a task and using the result from the first replica to finish. Replication schedulers (i.e. schedulers that employ task replication) are able to achieve good performance even in the absence of information on tasks and resources. They are also of smaller complexity than traditional schedulers, making them better suitable for large distributed systems. On the other hand, replication schedulers waste cycles with the replicas that are not the first to finish. Moreover, this extra consumption of resources raises severe concerns about the system-wide performance of a distributed system with multiple, competing replication schedulers. This paper presents a comprehensive study of task replication, comparing replication schedulers against traditional information-based schedulers, and establishing their efficacy (the performance delivered to the application), efficiency (the amount of resources wasted), and emergent behavior (the system-wide behavior of a system with multiple replication schedulers). We also introduce a simple access control strategy that can be implemented locally by each resource and greatly improves overall performance of a system on which multiple replication schedulers compete for resources.
机译:大型分布式系统对传统的调度程序提出了挑战,因为通常很难先验地确定每个任务在每种资源上需要花费多长时间才能完成,这些信息是为此类调度程序输入的。任务复制已被用于各种方案中,以解决此问题。任务复制包括分派任务的多个副本,并使用第一个副本的结果来完成。即使没有关于任务和资源的信息,复制调度程序(即采用任务复制的调度程序)也能够实现良好的性能。与传统的调度程序相比,它们的复杂度也较小,从而使其更适合于大型分布式系统。另一方面,复制调度程序浪费了并非首先完成的副本的周期。此外,这种额外的资源消耗引起了人们对具有多个竞争复制调度程序的分布式系统的系统范围性能的严重关注。本文对任务复制进行了全面的研究,将复制调度程序与传统的基于信息的调度程序进行了比较,并确定了它们的功效(交付给应用程序的性能),效率(浪费的资源量)和紧急行为(系统范围的)具有多个复制调度程序的系统的行为)。我们还介绍了一种简单的访问控制策略,该策略可以由每种资源在本地实现,并大大提高了多个复制调度程序争用资源的系统的整体性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号