首页> 外文会议>IEEE International Conference on Cloud Computing >Performance Interference-Aware Vertical Elasticity for Cloud-Hosted Latency-Sensitive Applications
【24h】

Performance Interference-Aware Vertical Elasticity for Cloud-Hosted Latency-Sensitive Applications

机译:云托管延迟敏感应用程序的性能感知感知垂直弹性

获取原文

摘要

Elastic auto-scaling in cloud platforms has primarily used horizontal scaling by assigning application instances to distributed resources. Owing to rapid advances in hardware, cloud providers are now seeking vertical elasticity before attempting horizontal scaling to provide elastic auto-scaling for applications. Vertical elasticity solutions must, however, be cognizant of performance interference that stems from multi-tenant collocated applications since interference significantly impacts application quality-of-service (QoS) properties, such as latency. The problem becomes more pronounced for latency-sensitive applications that demand strict QoS properties. Further exacerbating the problem are variations in workloads, which make it hard to determine the right kinds of timely resource adaptations for latency-sensitive applications. To address these challenges and overcome limitations in existing offline approaches, we present an online, data-driven approach which utilizes Gaussian Processes-based machine learning techniques to build runtime predictive models of the performance of the system under different levels of interference. The predictive online models are then used in dynamically adapting to the workload variability by vertically auto-scaling co-located applications such that performance interference is minimized and QoS properties of latency-sensitive applications are met.
机译:云平台中的弹性自动扩展主要通过将应用程序实例分配给分布式资源来使用水平扩展。由于硬件的飞速发展,云提供商现在在尝试水平缩放以为应用程序提供弹性自动缩放之前寻求垂直弹性。但是,垂直弹性解决方案必须意识到多租户并置应用程序引起的性能干扰,因为干扰会严重影响应用程序的服务质量(QoS)属性,例如延迟。对于需要严格的QoS属性的对延迟敏感的应用程序,此问题变得更加突出。工作负载的变化进一步加剧了该问题,这使得难以为对延迟敏感的应用程序确定正确的及时资源适配类型。为了解决这些挑战并克服现有脱机方法的局限性,我们提出了一种在线的,数据驱动的方法,该方法利用基于高斯过程的机器学习技术来构建在不同干扰水平下系统性能的运行时预测模型。然后,将预测性在线模型用于通过垂直自动缩放并置应用程序来动态适应工作负载可变性,从而最大程度地降低性能干扰并满足对延迟敏感的应用程序的QoS属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号