首页> 外文会议>IEEE/IFIP International Conference on Dependable Systems Networks >Automatic fault detection and diagnosis in complex software systems by information-theoretic monitoring
【24h】

Automatic fault detection and diagnosis in complex software systems by information-theoretic monitoring

机译:通过信息理论监测自动故障检测和复杂软件系统的诊断

获取原文

摘要

Management metrics of complex software systems exhibit stable correlations which can enable fault detection and diagnosis. Current approaches use specific analytic forms, typically linear, for modeling correlations. In this paper we use Normalized Mutual Information as a similarity measure to identify clusters of correlated metrics, without knowing the specific form. We show how we can apply the Wilcoxon Rank-Sum test to identify anomalous behaviour. We present two diagnosis algorithms to locate faulty components: RatioScore, based on the Jaccard Coefficient, and SigScore, which incorporates knowledge of component dependencies. We evaluate our mechanisms in the context of a complex enterprise application. Through fault-injection experiments, we show that we can detect 17 out of 22 faults without any false positives. We diagnose the faulty component in the top five anomaly scores 7 times out of 17 using SigScore, which is 40% better than when system structure is ignored.
机译:复杂软件系统的管理指标表现出稳定的相关性,可以实现故障检测和诊断。目前的方法使用特定的分析形式,通常是线性的,以进行建模相关性。在本文中,我们使用标准化的互信息作为相似度量来识别相关度量集群,而不知道特定形式。我们展示了如何应用Wilcoxon秩和测试来识别异常行为。我们提出了两个诊断算法来定位故障组件:比率基于Jaccard系数和SigScore,它包含了组件依赖性的知识。我们在复杂的企业应用程序的背景下评估我们的机制。通过故障注入实验,我们表明我们可以在没有任何误报的情况下检测到22个故障中的17个。我们使用SIGSCORE诊断前五个异常分数的错误组分7次,比忽略系统结构的时间好40%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号