首页> 外文期刊>Emerging Topics in Computing, IEEE Transactions on >InvarNet-X: A Black-Box Invariant-Based Approach to Diagnosing Big Data Systems
【24h】

InvarNet-X: A Black-Box Invariant-Based Approach to Diagnosing Big Data Systems

机译: InvarNet-X :一种基于黑箱不变式的大数据系统诊断方法

获取原文
获取原文并翻译 | 示例
       

摘要

As big data spreads rapidly, performance problems in these systems become common concerns. As the first line of defending these problems, performance diagnosis plays an essential role in big data systems. It is notoriously difficult to conduct performance diagnosis in large distributed systems. Previous work either pinpoint the root causes by instrumenting the applications or runtime systems in a white-box way, which leads to a considerable overhead, or just provide some hints to the hidden root causes in a black-box way. Very few works focus on pinpointing the real root causes in a black-box way. To address this problem, this paper proposes a black-box invariant-based diagnosing approach and implements a proof-of-concept system named InvarNet-X. In this paper, performance diagnosis is formalized as a pattern recognition problem, meaning that each performance problem is identified by a specific pattern. The rationale of InvarNet-X is that the unobservable root causes of performance problems always expose themselves through the violations of the associations among directly observable performance metrics. Such observable associations are called likely invariants calculated by the maximal information criterion, and each performance problem is signified by a sparse distributed representation. A problem signature database is constructed by training multiple real performance problems in advance. Once a performance anomaly is detected, the diagnosing procedure is triggered. The root cause is pinpointed by retrieving similar signatures in the signature database. The experimental evaluations in a controlled big data system show that InvarNet-X can achieve a high accuracy in diagnosing some real performance problems reported in software bug repositories, which is superior to several state-of-the-art approaches. Moreover, the light-weight property makes InvarNet-X easily facilitated in large-scale big data systems in real time.
机译:随着大数据的迅速传播,这些系统中的性能问题已成为普遍关注的问题。作为防御这些问题的第一线,性能诊断在大数据系统中起着至关重要的作用。众所周知,在大型分布式系统中进行性能诊断非常困难。先前的工作或者通过以白盒方式对应用程序或运行时系统进行检测来查明根本原因,这会导致相当大的开销,或者只是以黑盒方式为隐藏的根本原因提供了一些提示。很少有作品会以黑匣子的方式来找出真正的根本原因。为了解决这个问题,本文提出了一种基于黑盒不变性的诊断方法,并实现了名为InvarNet-X的概念验证系统。在本文中,性能诊断被形式化为模式识别问题,这意味着每个性能问题都由特定的模式识别。 InvarNet-X的基本原理是,性能问题的不可观察的根本原因总是通过违反直接可观察的性能指标之间的关联而暴露出来。此类可观察的关联称为通过最大信息标准计算的可能不变量,并且每个性能问题都由稀疏的分布表示表示。通过预先训练多个实际性能问题来构建问题签名数据库。一旦检测到性能异常,就会触发诊断过程。根本原因是通过在签名数据库中检索相似的签名来确定的。在受控的大数据系统中进行的实验评估表明,InvarNet-X在诊断软件错误存储库中报告的某些实际性能问题方面可以达到较高的准确性,它优于几种最新方法。而且,轻量级的特性使得InvarNet-X可以轻松地在大型实时大数据系统中实时实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号