首页> 外文会议>IEEE International Conference on Cloud Computing >Automated Enforcement of SLA for Cloud Services
【24h】

Automated Enforcement of SLA for Cloud Services

机译:自动执行SLA for Cloud Services

获取原文

摘要

Orchestration and management of cloud computing entities necessitate measuring and analysis of real-time monitored performance metrics. However, decision making in current management platforms are addressed separately in different cloud stack layers. These isolated active management decisions may degrade the total performance of the cloud system. Since, cloud computing platforms lack an integrated analytics and management capability, in this paper, we propose an integrated platform to detect and predict situations where corrective actions are required. First, a Dynamic Bayesian Network (DBN) is trained and updated by collected data to calculate the causal dependencies among various entities in different cloud service layers. The correlation values are then fed into a Long Short-Term Memory (LSTM) neural network to predict the future states. States that violate the Service Level Agreement(SLA) of cloud services are learned with training data, and if the forecasted states threaten the SLA of cloud services, associated events are generated to trigger management actions. Next, management actions are assigned a different set of events using a reinforcement learning approach. A set of experiments based on collected data from a real cloud service environment is conducted to validate the proposed approach. Experimental results indicate that the proposed method outperforms the current management solutions and improves web request response time by up to 7% and decreases SLA violation by 79% in the context of web application auto-scaling.
机译:云计算实体的编排和管理需要对实时监控的性能指标进行测量和分析。但是,当前管理平台中的决策是在不同的云堆栈层中分别处理的。这些孤立的主动管理决策可能会降低云系统的总体性能。由于云计算平台缺乏集成的分析和管理功能,因此在本文中,我们提出了一个集成的平台来检测和预测需要采取纠正措施的情况。首先,通过收集的数据对动态贝叶斯网络(DBN)进行培训和更新,以计算不同云服务层中各个实体之间的因果关系。然后将相关值输入到长期短期记忆(LSTM)神经网络中,以预测将来的状态。可以从培训数据中了解违反云服务的服务水平协议(SLA)的州,如果预测的州威胁到云服务的SLA,则会生成关联的事件以触发管理操作。接下来,使用强化学习方法为管理操作分配不同的事件集。基于从真实云服务环境中收集的数据进行了一组实验,以验证所提出的方法。实验结果表明,在Web应用程序自动缩放的情况下,该方法的性能优于当前的管理解决方案,将Web请求的响应时间缩短了7%,并减少了79%的SLA违规。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号