首页> 外文会议>IEEE International Symposium on Service-Oriented System Engineering >An Ensemble Signature-Based Approach for Performance Diagnosis in Big Data Platform
【24h】

An Ensemble Signature-Based Approach for Performance Diagnosis in Big Data Platform

机译:基于集成签名的大数据平台性能诊断方法

获取原文

摘要

The big data platform always suffers from performance problems due to internal impairments (e.g. software bugs) and external impairments (e.g. resource hog). And the situation is exacerbated by the properties of velocity, variety and volume (3Vs) of big data. To recovery the system from performance anomaly, the first step is to find the root causes. In this paper, we propose a novel signature-based performance diagnosis approach to rapidly pinpoint the root causes of performance problems in big data platforms. The performance diagnosis is formalized as a pattern recognition problem. We leverage Maximum Information Criterion (MIC) to express the invariant relationships amongst the performance metrics in the normal state. Each performance problem occurred in the big data platform is signified by a unique binary vector named signature, which consists of a set of violations of MIC invariants. The signatures of multiple performance problems form a signature database. If the Key Performance Indicator (KPI) of the big data application exhibits model drift, our approach can identify the real culprits by retrieving the root causes which have similar signatures to the current performance problem. Moreover, considering the diversity of big data applications, we establish an ensemble approach to treat each application separately. The experiment evaluations in a controlled big data platform show that our approach can pinpoint the real culprits of performance problems in an average 84% precision and 87% recall when one fault occurs, which is better than several state-of-the-art approaches.
机译:大数据平台始终会由于内部缺陷(例如软件错误)和外部缺陷(例如资源浪费)而遭受性能问题的困扰。大数据的速度,多样性和体积(3Vs)的特性使情况更加恶化。要从性能异常中恢复系统,第一步是找到根本原因。在本文中,我们提出了一种新颖的基于签名的性能诊断方法,以快速查明大数据平台中性能问题的根本原因。性能诊断被形式化为模式识别问题。我们利用最大信息标准(MIC)来表示正常状态下性能指标之间的不变关系。大数据平台中发生的每个性能问题都由一个称为签名的唯一二进制向量表示,该向量由一组违反MIC不变量组成。多个性能问题的签名形成一个签名数据库。如果大数据应用程序的关键绩效指标(KPI)出现模型漂移,则我们的方法可以通过检索与当前性能问题具有相似特征的根本原因来识别真正的罪魁祸首。此外,考虑到大数据应用程序的多样性,我们建立了整体方法来分别处理每个应用程序。在可控的大数据平台上进行的实验评估表明,当发生一个故障时,我们的方法可以以84%的平均精度和87%的召回率查明性能问题的真正原因,这比几种最新技术要好。方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号