首页> 外文会议>International Conference of Emerging Applications of Information Technology >Learning Based Performance and Power Efficient Cluster Resource Manager for CPU-GPU Cluster
【24h】

Learning Based Performance and Power Efficient Cluster Resource Manager for CPU-GPU Cluster

机译:基于学习CPU-GPU群集的性能和功率高效群集资源管理器

获取原文

摘要

The recent success in building petascale High Performance Computing (HPC) systems have produced the demand for efficient and optimized use of resources to increase the performance and reduce the power consumption. Including the above, the heterogeneous architectures of nowadays HPCs comprising a multicore CPU and many-core Accelerator like GPU(s) are facing another concern for using optimum utilization of each of these components. This paper presents the scheduling mechanism of the Cluster Resource Manager (CRM): i. Moldable job Scheduler (MS) which is able to mold the jobs with respect to the number of machines based on an preliminary initialized and auto updated heuristic knowledge-base of problem size, optimum machine count, execution duration to increase the utilization of the full cluster facility. ii) Collocation Aware and Power Efficient Resource Manager (CAPE-RM) manages collocation of CPU only and GPU accelerated jobs by monitoring the CPU load and memory usage. The emerging computation ability is followed by the huge amount of power consumption. Though the use of GPU(s) itself cut down the power to be needed by the only CPU based cluster but to make a green computing facility more power efficiency is desired. The CAPE-RM is designed to support the above by powering off the idle nodes by monitoring the total load to the facility and based on a simple statistic of the frequency of job submission.
机译:最近在建设芭光板高性能计算(HPC)系统方面的成功已经生产了对高效和优化的资源使用的需求,以提高性能并降低功耗。包括以上所示,当前HPC的异构架构包括多核CPU和GPU等许多核心加速器,如GPU所面临的另一个问题,用于使用这些组件中的每一个的最佳利用。本文介绍了群集资源管理器(CRM)的调度机制:i。可模塑作业调度程序(MS)能够基于初始初始化和自动更新的机器型启发式知识库,最佳机器计数,执行持续时间来利用机器的数量来模制作业,以增加完整群集的利用率设施。 ii)焊接感知和功率高效资源管理器(Cape-RM)管理仅限CPU和GPU加速作业通过监控CPU负载和内存使用情况来管理CPU和GPU加速作业。新兴的计算能力之后是大量的功耗。虽然使用GPU本身剪切唯一基于CPU的集群需要的功率,但要使绿色计算设施需要更多功率效率。 CAPE-RM旨在通过监视设施的总负载,并基于作业提交频率的简单统计来支持上述空闲节点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号