A Delayed Checkpoint Approach for Communication-induced Checkpointing in Autonomic Computing

机译：一种延迟检查点方法，用于沟通诱导的自主计算中的检查点

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although the initiative of Autonomic Computing was introduced a dozen years ago, several challenges remain open. One of these challenges is the efficient monitoring at runtime oriented to the detection, diagnosis, and repair of problems that result from failures or bugs in software and/or hardware components. For this purpose, Communication-induced Checkpointing (CIC) can be a useful tool. Communication-induced Checkpointing has been used to attack a wide range of problems that arise in distributed systems, such as rollback recovery, software debugging and software verification, among others. In CIC algorithms, an autonomic component (process) asynchronously cooperates by exchanging information on the application messages about saved local states called checkpoints. CIC aims to form global consistent snapshots by grouping checkpoints (one by each component) in a non-coordinated way. To achieve this, CIC solutions continuously monitor the exchanged control information to identify possible dangerous checkpointing patterns. When a dangerous pattern is identified, it is broken by locally triggering a forced checkpoint. Nevertheless, as we will show, not all forced checkpoints triggered by current solutions are necessary. In this paper, we present a delayed checkpoint approach suitable for autonomic computing that reduces forced checkpoints by establishing certain triggering rules that we call safe checkpoint conditions. Finally, some results are presented which show that our proposal is more efficient than other current solutions.

机译：虽然自主计算的举措是在十几年前推出，一些挑战仍然开放。这些挑战之一是在运行时有效的监测面向检测，诊断，以及来自于软件和/或硬件组件故障或错误而导致的问题修复。为了这个目的，通信诱导的检查点（CIC）可以是一个有用的工具。通讯引起的检查点已经被用来攻击大范围的分布式系统中，如回滚恢复，软件调试和软件验证，等等出现的问题。在CIC算法，自主成分（处理）通过异步关于保存本地状态称为检查点的应用程序消息交换信息相配合。中投公司的目标，形成由非协调的方式分组检查站（一个由每个组件）的全球一致的快照。为了实现这一目标，中投公司的解决方案持续监控交换的控制信息来识别可能的危险的检查点的图案。当一个危险的模式是确定的，它是由本地触发强制检查点打破。然而，当我们将展示，而不是目前的解决方案触发的所有检查站被迫需要。在本文中，我们提出了适合于减少被迫通过建立一定的触发规则，我们称之为安全检查站检查站的条件自主计算延迟检查站的办法。最后，一些结果呈现这表明，我们的建议是比其他现有解决方案更有效。

著录项

来源
《IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises》|2013年||共6页
会议地点
作者
Alberto Calixto Simon; Saul E. Pomares Hernandez; Jose Roberto Perez Cruz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Distributed Systems; Communication-induced checkpointing; Autonomic Computing;

机译：分布式系统;通信引起的检查点;自主计算;

相似文献

外文文献
中文文献
专利

1. Self-healing in autonomic distributed systems based on delayed communication-induced checkpointing [J] . Simón Alberto Calixto, Pomares Hernández Saúl Eduardo Pomares, Pérez-Cruz José Roberto, International journal of autonomous and adaptive communications systems . 2016,第3a4期

机译：基于延迟通信诱发检查点的自主分布式系统中的自我修复
2. Lowoverhead communication-induced checkpointing protocols ensuring rollback-dependency trackability property [J] . Z. Abdelhafidi, N. Lagraa, M. B. Yagoubi, Concurrency, practice and experience . 2017,第21期

机译：低开销通信引起的检查点协议，确保回滚相关性可跟踪性
3. A Scalable Communication-Induced Checkpointing Algorithm for Distributed Systems [J] . Alberto CALIXTO SIMON, Saul E. POMARES HERNANDEZ, Jose Roberto PEREZ CRUZ, IEICE transactions on information and systems . 2013,第4期

机译：分布式系统的可扩展通信诱导检查点算法
4. A Delayed Checkpoint Approach for Communication-Induced Checkpointing in Autonomic Computing [C] . Simon Alberto Calixto, Hernandez Saul E.Pomares, Cruz Jose Roberto Perez 2013 IEEE 22nd International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises . 2013

机译：自主计算中通信诱导检查点的延迟检查点方法
5. Communication-Induced Checkpointing and Recovery Protocols for Distributed Systems [D] . Luo, Yi 2011

机译：分布式系统的通信诱导检查点和恢复协议
6. Chromosomes with delayed replication timing lead to checkpoint activation delayed recruitment of Aurora B and chromosome instability [O] . BH Chang, L Smith, J Huang, -1

机译：复制时间延迟的染色体会导致检查点激活Aurora B的募集延迟和染色体不稳定
7. Brief Announcement: Mutable Checkpoints: A New Checkpointing Approach for Mobile Computing Systems [O] . Guohong Cao, Mukesh Singhal 2014

机译：简要说明：可变检查点：移动计算系统的一种新的检查点方法

A Delayed Checkpoint Approach for Communication-induced Checkpointing in Autonomic Computing

摘要

著录项

相似文献

相关主题

期刊订阅