【24h】

Gaia Scheduler: A Kubernetes-Based Scheduler Framework

机译:Gaia Scheduler:基于Kubernetes的Scheduler框架

获取原文
获取原文并翻译 | 示例

摘要

This paper proposed a topology-based GPU scheduling framework. The framework is based on the traditional kubernetes GPU scheduling algorithm. In existing algorithms, GPU can only be completely allocated (In other words, a GPU is the smallest unit of resource allocation), the GPU resources are not fully used and the load is uneven. In our algorithm, GPU cluster topology is restored in a GPU cluster resource access cost tree, and different GPU resource application scenarios are scheduled and dynamically adjusted based on the resource access cost tree to obtain an optimal scheduling effect. The kubernetes GPU cluster load is effectively improved. Experimental results show that performance on load balance and resource utilization are also improved in this way. GaiaGPU has been widely used in the production practice of Tencent, the application of GaiaGPU has increased the resource utilization of GPU cluster by about 10%.
机译:本文提出了一种基于拓扑的GPU调度框架。该框架基于传统的kubernetes GPU调度算法。在现有算法中,只能完全分配GPU(换句话说,GPU是资源分配的最小单位),GPU资源没有得到充分利用,负载不均。在我们的算法中,将GPU集群拓扑结构还原到GPU集群资源访问成本树中,并根据资源访问成本树对不同的GPU资源应用场景进行调度和动态调整,以获得最佳的调度效果。 kubernetes GPU集群负载得到有效改善。实验结果表明,以这种方式还可以改善负载平衡和资源利用方面的性能。 GaiaGPU已在腾讯的生产实践中广泛使用,GaiaGPU的应用使GPU集群的资源利用率提高了约10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号