首页> 外文会议>2011 Sixth International Conference on Availability, Reliability and Security >Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems
【24h】

Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems

机译:通过集成无监督和半监督学习对可靠云系统进行主动故障管理

获取原文

摘要

Cloud computing systems continue to grow in their scale and complexity. They are changing dynamically as well due to the addition and removal of system components, changing execution environments, frequent updates and upgrades, online repairs and more. In such large-scale complex and dynamic systems, failures are common. In this paper, we present a failure prediction mechanism exploiting both unsupervised and semi-supervised learning techniques for building dependable cloud computing systems. The unsupervised failure detection method uses an ensemble of Bayesian models. It characterizes normal execution states of the system and detects anomalous behaviors. After the anomalies are verified by system administrators, labeled data are available. Then, we apply supervised learning based on decision tree classier to predict future failure occurrences in the cloud. Experimental results in an institute-wide cloud computing system show that our proposed method can forecast failure dynamics with high accuracy.
机译:云计算系统的规模和复杂性不断增长。由于添加和删除了系统组件,更改了执行环境,频繁的更新和升级,在线维修等等,它们也在动态变化。在这样的大型复杂而动态的系统中,故障很常见。在本文中,我们提出了一种故障预测机制,该机制利用无监督和半监督学习技术来构建可靠的云计算系统。无监督故障检测方法使用贝叶斯模型的集成。它表征系统的正常执行状态并检测异常行为。由系统管理员验证异常之后,可以使用带标签的数据。然后,我们基于决策树分类器应用监督学习来预测云中将来发生的故障。在整个研究所的云计算系统中的实验结果表明,我们提出的方法可以高精度地预测故障动态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号