首页> 外文期刊>Parallel Computing >DDS:A deadlock detection-based scheduling algorithm for workflow computations in HPC systems with storage constraints
【24h】

DDS:A deadlock detection-based scheduling algorithm for workflow computations in HPC systems with storage constraints

机译:DDS:基于死锁检测的调度算法,用于具有存储约束的HPC系统中的工作流计算

获取原文
获取原文并翻译 | 示例
           

摘要

Workflow-based workloads usually consist of multiple instances of the same workflow, which are jobs with control or data dependencies, to carry out a well-defined scientific computation task, with each instance acting on its own input data. To maximize throughput performance, a high degree of concurrency is achievable by running multiple instances simultaneously. However, deadlock is a potential problem when storage is constrained. To address this problem, we design and evaluate a deadlock detection-based scheduling (DDS) algorithm that can achieve high performance by making the best use of the available storage resources. Our algorithm takes advantages of the dataflow information of the workflow to speculatively schedule each instance if the instant storage is sufficient for some constituent jobs, but not necessarily for the whole workflow instance. Whenever deadlock or a performance anomaly is detected, some selected in-progress workflow instances are required to be rollbacked to release storage for other blocked jobs. We develop a suite of strategies to select the victims and beneficiaries (instances or jobs) and evaluate their performance via a simulation-based study. Our results show that the DDS algorithm can adapt the job concurrency to the available storage resources and achieve higher performance than some deadlock avoidance methods in our synthetic and real workflow computations.
机译:基于工作流的工作负载通常由同一工作流的多个实例组成,这些实例是具有控制或数据依赖性的作业,以执行定义明确的科学计算任务,每个实例都对自己的输入数据起作用。为了最大化吞吐量性能,可以通过同时运行多个实例来实现高度的并发性。但是,当存储受到限制时,死锁是一个潜在的问题。为了解决此问题,我们设计并评估了基于死锁检测的调度(DDS)算法,该算法可以通过充分利用可用存储资源来实现高性能。如果即时存储足以完成某些组成工作,但不一定需要整个工作流实例,我们的算法将利用工作流的数据流信息来推测性地调度每个实例。每当检测到死锁或性能异常时,都需要回滚某些选定的正在进行中的工作流实例,以释放其他阻塞作业的存储空间。我们开发了一套策略来选择受害者和受益者(实例或工作),并通过基于模拟的研究评估他们的表现。我们的结果表明,在我们的综合和实际工作流计算中,DDS算法可以使作业并发适应可用的存储资源,并比某些避免死锁的方法实现更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号