首页> 外文会议>IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises >A Delayed Checkpoint Approach for Communication-induced Checkpointing in Autonomic Computing
【24h】

A Delayed Checkpoint Approach for Communication-induced Checkpointing in Autonomic Computing

机译:一种延迟检查点方法,用于沟通诱导的自主计算中的检查点

获取原文

摘要

Although the initiative of Autonomic Computing was introduced a dozen years ago, several challenges remain open. One of these challenges is the efficient monitoring at runtime oriented to the detection, diagnosis, and repair of problems that result from failures or bugs in software and/or hardware components. For this purpose, Communication-induced Checkpointing (CIC) can be a useful tool. Communication-induced Checkpointing has been used to attack a wide range of problems that arise in distributed systems, such as rollback recovery, software debugging and software verification, among others. In CIC algorithms, an autonomic component (process) asynchronously cooperates by exchanging information on the application messages about saved local states called checkpoints. CIC aims to form global consistent snapshots by grouping checkpoints (one by each component) in a non-coordinated way. To achieve this, CIC solutions continuously monitor the exchanged control information to identify possible dangerous checkpointing patterns. When a dangerous pattern is identified, it is broken by locally triggering a forced checkpoint. Nevertheless, as we will show, not all forced checkpoints triggered by current solutions are necessary. In this paper, we present a delayed checkpoint approach suitable for autonomic computing that reduces forced checkpoints by establishing certain triggering rules that we call safe checkpoint conditions. Finally, some results are presented which show that our proposal is more efficient than other current solutions.
机译:虽然自主计算的举措是在十几年前推出,一些挑战仍然开放。这些挑战之一是在运行时有效的监测面向检测,诊断,以及来自于软件和/或硬件组件故障或错误而导致的问题修复。为了这个目的,通信诱导的检查点(CIC)可以是一个有用的工具。通讯引起的检查点已经被用来攻击大范围的分布式系统中,如回滚恢复,软件调试和软件验证,等等出现的问题。在CIC算法,自主成分(处理)通过异步关于保存本地状态称为检查点的应用程序消息交换信息相配合。中投公司的目标,形成由非协调的方式分组检查站(一个由每个组件)的全球一致的快照。为了实现这一目标,中投公司的解决方案持续监控交换的控制信息来识别可能的危险的检查点的图案。当一个危险的模式是确定的,它是由本地触发强制检查点打破。然而,当我们将展示,而不是目前的解决方案触发的所有检查站被迫需要。在本文中,我们提出了适合于减少被迫通过建立一定的触发规则,我们称之为安全检查站检查站的条件自主计算延迟检查站的办法。最后,一些结果呈现这表明,我们的建议是比其他现有解决方案更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号