首页> 外文会议>IEEE International Conference on Data Engineering >Automated Diagnosis of System Failures with Fa
【24h】

Automated Diagnosis of System Failures with Fa

机译:使用FA自动诊断系统故障

获取原文

摘要

Failures of Internet services and enterprise systems lead to user dissatisfaction and considerable loss of revenue. Since manual diagnosis is often laborious and slow, there is considerable interest in tools that can diagnose the cause of failures quickly and automatically from system-monitoring data. Fa uses monitoring data to construct a database of {em failure signatures} against which data from undiagnosed failures can be matched. Two novel challenges we address are to make signatures robust to the noisy monitoring data in production systems, and to generate reliable confidence estimates for matches. Fa uses a new technique called {em anomaly-based clustering} when thesignature database has no high-confidence match for an undiagnosed failure. This technique clusters monitoring data based on how it differs from the failure data, and pinpoints attributes linked to the failure. We show the effectiveness of Fa through a comprehensive experimental evaluation based on failures from a production setting, a variety of failures injected in a testbed, and synthetic data.
机译:互联网服务和企业系统的失败导致用户不满和大幅损失。由于手动诊断往往是费力和缓慢,因此对可以快速并自动地从系统监控数据诊断故障原因的工具有相当大的兴趣。 FA使用监视数据构造{EM失败签名}的数据库,从而可以匹配来自未确认的失败的数据。我们地址的两项新颖挑战是使签名对生产系统中的嘈杂监测数据进行强大,并为匹配产生可靠的信心估计。 FA使用称为{EM异常的聚类}的新技术}当Ingnatureature数据库没有高频率匹配的未确诊失败时。此技术基于与故障数据的不同方式监视数据,并针对与故障相关的属性。我们通过基于生产环境的故障的故障显示FA的综合实验评估,在测试平台和合成数据中注入的各种故障,展示了FA的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号