首页> 外文会议>IEEE International Conference on Network Protocols >Maximizing container-based network isolation in parallel computing clusters
【24h】

Maximizing container-based network isolation in parallel computing clusters

机译:最大化并行计算集群中基于容器的网络隔离

获取原文

摘要

Data-parallel applications, especially those associated with user-facing web services, have struggled to enhance their worst case performance. It is therefore important to improve the minimum amount of resources guaranteed for applications in a cluster. Existing cluster management frameworks, however, provide isolation for computation resources (such as CPU) only, and are oblivious to network isolation guarantees. In this paper, we design, implement and evaluate Libra, a new cluster management framework that helps to maximize the isolation guarantee for the bandwidth requirements from applications. We start with a theoretical analysis of the network sharing problem, which contains two key steps: container placement and bandwidth allocation. By collecting the status of access links and the bandwidth demand of applications, we coordinate the placement of containers to minimize the system bottleneck such that the bandwidth guarantee for applications can be optimized. We further embrace host-based rate limiting to ensure such maximized bandwidth guarantee can be reached without hurting network utilization. Both our testbed-based experiments and large-scale simulations demonstrate that Libra significantly improves the network isolation guarantee: in comparison with existing cluster managers and network schedulers, the performance gain is more than 105.59%. Meanwhile, it improves application performance by 57.71% and maintains high network utilization.
机译:数据并行应用程序,尤其是那些与面向用户的Web服务相关的应用程序,一直在努力提高其最坏情况的性能。因此,重要的是提高为群集中的应用程序保证的最少资源量。但是,现有的群集管理框架仅为计算资源(例如CPU)提供隔离,并且没有网络隔离保证。在本文中,我们设计,实施和评估Libra,这是一种新的群集管理框架,有助于最大程度地隔离应用程序对带宽要求的隔离保证。我们从对网络共享问题的理论分析开始,其中包括两个关键步骤:容器放置和带宽分配。通过收集访问链接的状态和应用程序的带宽需求,我们协调容器的放置以最大程度地减少系统瓶颈,从而可以优化应用程序的带宽保证。我们进一步采用基于主机的速率限制,以确保在不损害网络利用率的情况下达到这种最大化的带宽保证。我们基于测试平台的实验和大规模仿真均表明,Libra大大改善了网络隔离保证:与现有的群集管理器和网络调度程序相比,性能提高了105.59%以上。同时,它将应用程序性能提高57.71%,并保持较高的网络利用率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号