首页> 外文期刊>Services Computing, IEEE Transactions on >Toward a Smart Cloud: A Review of Fault-Tolerance Methods in Cloud Systems
【24h】

Toward a Smart Cloud: A Review of Fault-Tolerance Methods in Cloud Systems

机译:迈向智能云:云系统中容错方法综述

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a comprehensive survey of the state-of-the-art work on fault tolerance methods proposed for cloud computing. The survey classifies fault-tolerance methods into three categories: 1) ReActive Methods (RAMs); 2) PRoactive Methods (PRMs); and 3) ReSilient Methods (RSMs). RAMs allow the system to enter into a fault status and then try to recover the system. PRMs tend to prevent the system from entering a fault status by implementing mechanisms that enable them to avoid errors before they affect the system. On the other hand, recently emerging RSMs aim to minimize the amount of time it takes for a system to recover from a fault. Machine Learning and Artificial Intelligence have played an active role in RSM domain in such a way that the recovery time is mapped to a function to be optimized (i.e., by converging the recovery time to a fraction of milliseconds). As the system learns to deal with new faults, the recovery time will become shorter. In addition, current issues and challenges in cloud fault tolerance are also discussed to identify promising areas for future research.
机译:本文介绍了对云计算提出的最先进工作的全面调查。该调查将容错方法分为三类:1)反应方法(RAM); 2)主动方法(PRMS); 3)弹性方法(RSM)。 RAM允许系统进入故障状态,然后尝试恢复系统。 PRMS倾向于阻止系统通过实现使能够避免错误的机制来进入故障状态,从而在它们影响系统之前避免错误。另一方面,最近涌现的RSM旨在最大限度地减少系统从故障中恢复所需的时间。机器学习和人工智能在RSM域中发挥了积极作用,使得恢复时间被映射到要优化的功能(即,通过将恢复时间与毫秒的一小部分聚集到毫秒)。由于系统学会处理新故障,恢复时间将变短。此外,还讨论了云容错中的当前问题和挑战,以确定未来研究的有希望的领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号