Contention Awareness And Fault-tolerant Schedulingfor Precedence Constrained Tasks In Heterogeneous Systems

Anne Benoit; Mourad Hakem; Yves Robert

首页> 外文期刊>Parallel Computing >Contention Awareness And Fault-tolerant Schedulingfor Precedence Constrained Tasks In Heterogeneous Systems

【24h】

Contention Awareness And Fault-tolerant Schedulingfor Precedence Constrained Tasks In Heterogeneous Systems

机译：异构系统中优先约束任务的竞争意识和容错调度

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Heterogeneous distributed systems are widely deployed for executing computationally intensive parallel applications with diverse computing needs. Such environments require effective scheduling strategies that take into account both algorithmic and architectural characteristics. Unfortunately, most of the scheduling algorithms developed for such systems rely on a simple platform model where communication contention is not taken into account. In addition, it is generally assumed that processors are completely safe. To schedule precedence graphs in a more realistic framework, we introduce first an efficient fault-tolerant scheduling algorithm that is both contention-aware and capable of supporting an arbitrary number of fail-silent (fail-stop) processor failures. Next, we derive a more complex heuristic that departs from the main principle of the first algorithm. Instead of considering a single task (one with highest priority) and assigning all its replicas to the currently best available resources, we consider a chunk of ready tasks, and assign all their replicas in the same decision making procedure. This leads to a better load balance of processors and communication links. We focus on a bi-criteria approach, where we aim at minimizing the total execution time, or latency, given a fixed number of failures supported in the system. Our algorithms have a low time complexity, and drastically reduce the number of additional communications induced by the replication mechanism. Experimental results fully demonstrate the usefulness of the proposed algorithms, which lead to efficient execution schemes while guaranteeing a prescribed level of fault-tolerance.

机译：异构分布式系统被广泛部署以执行具有各种计算需求的计算密集型并行应用程序。这样的环境需要考虑算法和架构特征的有效调度策略。不幸的是，为这种系统开发的大多数调度算法都依赖于一个简单的平台模型，其中没有考虑通信争用。另外，通常假定处理器是完全安全的。为了在更现实的框架中调度优先级图，我们首先介绍一种高效的容错调度算法，该算法既具有竞争意识，又能够支持任意数量的故障静默（故障停止）处理器故障。接下来，我们推导了一种更复杂的启发式方法，它与第一种算法的主要原理背道而驰。我们不考虑单个任务（具有最高优先级的任务）并将其所有副本分配给当前最佳可用资源，而是考虑大量就绪任务，并在同一决策过程中分配其所有副本。这样可以更好地平衡处理器和通信链路的负载。我们专注于双标准方法，在给定的系统支持的固定故障数量下，我们旨在最大程度地减少总执行时间或延迟。我们的算法具有较低的时间复杂度，并大大减少了由复制机制引起的附加通信的数量。实验结果充分证明了所提出算法的有效性，从而在保证规定的容错水平的同时，提供了有效的执行方案。

著录项

来源
《Parallel Computing》 |2009年第2期|83-108|共26页
作者
Anne Benoit; Mourad Hakem; Yves Robert;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
communication contention; fault-tolerant scheduling; heterogeneous systems;

机译：通信争端容错调度异构系统;

相似文献

外文文献
中文文献
专利

1. Contention-aware optimal scheduling of real-time precedence-constrained task graphs on heterogeneous distributed systems [J] . Roy Sanjit Kumar, Devaraj Rajesh, Sarkar Arnab, Journal of systems architecture . 2020,第1期

机译：异构分布式系统上实时优先限制任务图的争用感知最佳调度
2. On The Design Of Communication-aware Fault-tolerant Scheduling Algorithms For Precedence Constrained Tasks In Grid Computing Systems With Dedicated Communication Devices [J] . Qin Zheng, Bharadwaj Veeravalli Journal of Parallel and Distributed Computing . 2009,第3期

机译：专用通信设备网格计算系统中优先约束任务的通信感知容错调度算法设计
3. A fault-tolerant scheduling algorithm based on a multi-objective genetic algorithm for precedence-constrained tasks in real-time heterogeneous distributed systems [J] . Yuanlong C., Dong M.P.L. Journal of computational and theoretical nanoscience . 2013,第5期

机译：基于多目标遗传算法的实时异构分布式系统中优先约束任务的容错调度算法
4. Fault-tolerant scheduling algorithm for precedence constrained tasks in grid computing systems with communication efficiency [C] . Ling Yun, Luo Zhenshan, Ge Yujia The 3rd International Conference on Information Sciences and Interaction Sciences . 2010

机译：具有通信效率的网格计算系统中优先约束任务的容错调度算法
5. Compile-time scheduling of precedence-constrained task graphs onto interconnection-constrained heterogeneous processor architectures [D] . Dandamudi, Siva Kumar V. 1995

机译：将优先级受限的任务图编译时调度到互连受限的异构处理器体系结构上
6. Fault-Tolerant Network-On-Chip Router Architecture Design for Heterogeneous Computing Systems in the Context of Internet of Things [O] . Muhammad Rashid, Naveed Khan Baloch, Muhammad Akmal Shafique, 2020

机译：容错网络上环路路由器架构在内容互联网上的异构计算系统
7. An efficient fault-tolerant scheduling algorithm for precedence constrained tasks in heterogeneous distributed systems [O] . Nakechbandi, Moustafa, Colin, Jean-Yves, Gashumba, Jean Baptiste 2006

机译：异构分布式系统中优先约束任务的高效容错调度算法
8. ROSES: An Efficient Scheduler for Precedence - Constrained Tasks on Concurrent Multiprocessors [R] . Barhen, J., Halbert, E. C. 1985

机译：ROsEs：优先级的高效调度程序 - 并发多处理器上的约束任务

Contention Awareness And Fault-tolerant Schedulingfor Precedence Constrained Tasks In Heterogeneous Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅