首页> 外文OA文献 >Checkpoint and run-time adaptation with pluggable parallelisation
【2h】

Checkpoint and run-time adaptation with pluggable parallelisation

机译:可插拔并行化的检查点和运行时自适应

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Enabling applications for computational Grids requires new approaches to develop applications that can effectively cope with resource volatility. Applications must be resilient to resource faults, adapting the behaviour to available resources. This paper describes an approach to application-level adaptation that efficiently supports application-level checkpointing. The key of this work is the concept of pluggable parallelisation, which localises parallelisation issues into multiple modules that can be (un)plugged to match resource availability. This paper shows how pluggable parallelisation can be extended to effectively support checkpointing and run-time adaptation. We present the developed pluggable mechanism that helps the programmer to include checkpointing in the base (sequential). Based on these mechanisms and on previous work on pluggable parallelisation, our approach is able to automatically add support for checkpointing in parallel execution environments. Moreover, applications can adapt from a sequential execution to a multi-cluster configuration. Adaptation can be performed by checkpointing the application and restarting on a different mode or can be performed during run-time. Pluggable parallelisation intrinsically promotes the separation of software functionality from fault-tolerance and adaptation issues facilitating their analysis and evolution. The work presented in this paper reinforces this idea by showing the feasibility of the approach and performance benefits that can be achieved.
机译:为计算网格启用应用程序需要新方法来开发可有效应对资源波动的应用程序。应用程序必须能够抵抗资源故障,使行为适应可用资源。本文介绍了一种可有效支持应用程序级检查点的应用程序级适应方法。这项工作的关键是可插拔并行化的概念,该概念将并行化问题本地化为多个模块,可以将这些模块(取消插入)以匹配资源可用性。本文展示了如何扩展可插拔并行化以有效支持检查点和运行时自适应。我们提出了已开发的可插拔机制,该机制可帮助程序员在基础(顺序)中包括检查点。基于这些机制以及以前在可插拔并行化方面的工作,我们的方法能够自动添加对并行执行环境中检查点的支持。而且,应用程序可以从顺序执行适应多集群配置。可以通过对应用程序进行检查并在其他模式下重新启动来执行适配,也可以在运行时执行适配。可插拔并行化本质上促进了软件功能与容错和自适应问题的分离,从而促进了它们的分析和发展。本文中的工作通过展示该方法的可行性和可以实现的性能优势,进一步强化了这一思想。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号