...
首页> 外文期刊>Journal of Grid Computing >Grid Resource Availability Prediction-Based Scheduling and Task Replication
【24h】

Grid Resource Availability Prediction-Based Scheduling and Task Replication

机译:基于网格资源可用性预测的调度和任务复制

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The frequent and volatile unavailability of volunteer-based Grid computing resources challenges Grid schedulers to make effective job placements. The manner in which host resources become unavailable will have different effects on different jobs, depending on their runtime and their ability to be checkpointed or replicated. A multi-state availability model can help improve scheduling performance by capturing the various ways a resource may be available or unavailable to the Grid. This paper uses a multi-state model and analyzes a machine availability trace in terms of that model. Several prediction techniques then forecast resource transitions into the model’s states. We analyze the accuracy of our predictors, which outperform existing approaches. We also propose and study several classes of schedulers that utilize the predictions, and a method for combining scheduling factors. We characterize the inherent tradeoff between job makespan and the number of evictions due to failure, and demonstrate how our schedulers can navigate this tradeoff under various scenarios. Lastly, we propose job replication techniques, which our schedulers utilize to replicate those jobs that are most likely to fail. Our replication strategies outperform others, as measured by improved makespan and fewer redundant operations. In particular, we define a new metric for replication efficiency, and demonstrate that our multi-state availability predictor can provide information that allows our schedulers to be more efficient than others that blindly replicate all jobs or some static percentage of jobs.
机译:基于志愿者的网格计算资源的频繁和不稳定的可用性挑战了网格调度程序来进行有效的工作安置。主机资源不可用的方式将对不同的作业产生不同的影响,具体取决于它们的运行时以及它们被检查点或复制的能力。多状态可用性模型可以通过捕获网格可能可用或不可用的各种方式来帮助提高调度性能。本文使用多状态模型,并根据该模型分析机器可用性跟踪。然后,几种预测技术可以预测资源向模型状态的过渡。我们分析了预测指标的准确性,该预测指标优于现有方法。我们还提出和研究利用预测的几类调度程序,以及一种组合调度因素的方法。我们描述了作业完成时间和由于故障而驱逐的次数之间固有的权衡,并演示了调度程序如何在各种情况下进行权衡。最后,我们提出了作业复制技术,我们的调度程序利用它来复制那些最有可能失败的作业。我们的复制策略优于其他策略,这可以通过改进的制造时间和更少的冗余操作来衡量。特别是,我们为复制效率定义了一个新的指标,并证明了我们的多状态可用性预测器可以提供使我们的调度程序比盲目地复制所有作业或某些静态作业百分比的调度程序更有效率的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号