首页> 外文会议>IEEE international conference on computer science and information technology >A Failure Detection and Prediction Mechanism for Enhancing Dependability of Data Centers
【24h】

A Failure Detection and Prediction Mechanism for Enhancing Dependability of Data Centers

机译:增强数据中心可靠性的故障检测与预测机制

获取原文

摘要

Modern data centers continue to grow in their scale and complexity. They are changing dynamically as well due to the addition and removal of system components, changing execution environments, frequent updates and upgrades, online repairs and more. Classical reliability theory and conventional methods do rarely consider the actual slate of a system and are therefore not capable to reflect the dynamics of runtime systems and failure processes. In this paper, we present an unsupervised failure detection and prediction method using an ensemble of Bayesian models. It characterizes normal execution states of the system and detects anomalous behaviors. We implement a prototype of our failure detection and prediction mechanism and evaluate its performance on a data center test platform. Experimental results show that our proposed method can forecast failure dynamics with high accuracy.
机译:现代数据中心的规模和复杂性不断增长。由于添加和删除了系统组件,更改了执行环境,频繁的更新和升级,在线维修等等,它们也在动态变化。经典的可靠性理论和常规方法很少考虑系统的实际情况,因此无法反映运行时系统和故障过程的动态。在本文中,我们提出了一种使用贝叶斯模型集成的无监督故障检测和预测方法。它表征系统的正常执行状态并检测异常行为。我们实施故障检测和预测机制的原型,并在数据中心测试平台上评估其性能。实验结果表明,本文提出的方法可以准确预测故障动态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号