首页> 外文会议>IEEE International Conference on Cluster Computing >Co-scheML: Interference-aware Container Co-scheduling Scheme Using Machine Learning Application Profiles for GPU Clusters
【24h】

Co-scheML: Interference-aware Container Co-scheduling Scheme Using Machine Learning Application Profiles for GPU Clusters

机译:Co-scheML:使用针对GPU集群的机器学习应用程序配置文件的可感知干扰的容器协同调度方案

获取原文

摘要

Recently, efficient execution of applications on Graphic Processing Unit(GPU) has emerged as a research topic to increase overall system throughput in cluster environment. As a current cluster orchestration platform using GPUs only supports an exclusive execution of an application on a GPU, the platform may not utilize resource of GPUs fully relying on application characteristics. Nonetheless, co-execution of GPU applications leads to interference coming from resource contention among applications. If diverse resource usage characteristics of GPU applications are not deliberated, unbalanced usage of computing resources and performance degradation could be induced in a GPU cluster. This study introduces Co-scheML for co-execution of various GPU applications such as High Performance Computing (HPC), Deep Learning (DL) Training, and DL Inference. Interference model is constructed by applying Machine Learning (ML) model with GPU metrics since predicting interference has a difficulty. Predicted interference is utilized and deployment of an application is determined by Co-scheML scheduler. Experimental results of the Co-ScheML strategy show that average job completion time is improved by 23%, and the makespan is shortened by 22% in average, as compared to baseline schedulers.
机译:最近,有效地在图形处理单元(GPU)上的应用程序被出现为一个研究主题,以提高集群环境中的整体系统吞吐量。作为使用GPU的当前群集编排平台仅支持GPU上的应用程序的独占执行,该平台可能无法利用GPU的资源完全依赖于应用特征。尽管如此,GPU应用程序的共同执行导致来自应用之间的资源争用的干扰。如果GPU应用程序的各种资源使用特性不刻意,则可以在GPU集群中引起计算资源和性能下降的不平衡使用。本研究介绍了共同执行用于共同执行各种GPU应用,例如高性能计算(HPC),深度学习(DL)训练和DL推断。通过使用GPU度量的机器学习(ML)模型来构造干扰模型,因为预测干扰具有难度。利用预测干扰,并通过Co-Scheml调度程序确定应用程序的部署。与基线调度率相比,平均工作完成时间提高了23%,平均工作完成时间提高了23%,平均缩短了22%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号