首页> 外文会议>International Symposium on Network Computing and Applications >A Dynamic Resource Controller for Resolving Quality of Service Issues in Modern Streaming Processing Engines
【24h】

A Dynamic Resource Controller for Resolving Quality of Service Issues in Modern Streaming Processing Engines

机译:用于解决现代流处理引擎中服务质量问题的动态资源控制器

获取原文

摘要

Devising an elastic resource allocation controller of data analytical applications in virtualized data-center has received a great attention recently, mainly due to the fact that even a slight performance improvement can translate to huge monetary savings in practical large-scale execution. Apache Flink is among modern streamed data processing run-times that can provide both low latency and high throughput computation in to execute processing pipelines over high-volume and high-velocity data-items under tight latency constraints. However, a yet to be answered challenge in a large-scale platform with tens of worker nodes is how to resolve the run-time violation in the quality of service (QoS) level in a multi-tenant data streaming platforms, particularly when the amount of workload generated by different users fluctuates. Studies showed that a static resource allocation algorithm (round-robin), which is used by default in Apache Flink, suffer from lack of responsiveness to sudden traffic surges happening unpredictably during the run-time. In this paper, we address the problem of resource management in a Flink platform for ensuring different QoS enforcement levels in a platform with shared computing resources. The proposed solution applies theoretical principals borrowed from close-loop control theory to design a CPU and memory adjustment mechanism with the primary goal to fulfill the different QoS levels requested by submitted applications while the resource interference is considered as the critical performance-limiting factor. The performance evaluation is carried out by comparing the proposed resource allocation mechanism with two static heuristics (round robin and class-based weighted fair queuing) in a 80-core cluster under multiple traffic patterns resembling sudden changes in the incoming workloads of low-priory streaming applications. The experimental results confirm the stability of the proposed controller to regulate the underlying platform resources to smoothly follow the target values (QoS violation rates). Particularly, the proposed solution can achieve higher efficiency compared to the other heuristics by reducing the response-time of high priority applications by 53% while maintaining the enforced QoS levels during the burst traffic periods.
机译:制定数据分析应用程序的弹性资源分配控制器的虚拟化数据中心已经获得了极大的关注最近,这主要是由于这样的事实,即使是轻微的性能提升可以转化为巨大的资金节约在实际的大规模执行。阿帕奇弗林克是现代流数据处理的运行时间,可以提供低延迟和高吞吐量计算在执行下严格的延时约束了大容量和高速数据项处理管道中。然而,在一个大型平台,数万工人节点的尚未回答的挑战是如何解决运行时违反服务质量(QoS)水平在一个多租户数据流的平台,特别是在量工作量的由不同的用户发生变动产生。表明,静态资源分配算法(循环),这是默认使用了Apache弗林克,缺乏响应的突发流量遭受浪涌的研究在运行时不可预知的发生。在本文中,我们解决一个弗林克平台,为了保证与共享计算资源在一个平台上不同的QoS级别的执法资源管理的问题。所提出的解决方案适用于从闭环控制理论借用来设计CPU和内存调节机制的主要目标履行提交申请,要求不同的QoS级别,而资源的干扰被认为是关键的性能限制因素理论的校长。绩效考核是通过在多个业务模式类似的低修道院流的输入工作量的突然变化在80核集群两个静态启发式(轮循和基于类的加权公平队列)所提出的资源分配机制的比较进行应用。实验结果验证了该控制器的稳定性,规范基础平台资源平滑地跟随目标值(QoS的违规率)。特别地,提出的解决方案相比,可以通过由53%减少的高优先级的应用程序的响应时间,而在突发业务量期间保持执行QoS级别的其他启发式实现更高的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号