首页> 外文会议>International Conference on Communication Systems and Network Technologies >An Autonomic Capacity Management Approach with Cloud Insight towards Cost-Efficient Throughput Optimization for High Performance Computing
【24h】

An Autonomic Capacity Management Approach with Cloud Insight towards Cost-Efficient Throughput Optimization for High Performance Computing

机译:一种自主能力管理方法,云洞察成本高效吞吐量优化高性能计算

获取原文

摘要

High Performance Computing (HPC) leverages cluster combined with a set of computing nodes exploiting computational capacities to handle varying job submission for scientific computing work. Inappropriate capacity planning and related management mechanism applied, will lead HPC cluster into a rather large number of pending jobs which is considered as a critical factor to affect System's throughput against the goal of HPC cluster. Moreover it will result in inefficiency and wasted capacities cost. Thus an autonomic capacity management approach is therefore proposed in this paper, in order to overcome such issues as regards. Firstly we survey recent researches related in deep, and find that they all lack of consideration on computing node's personality which is crucial to solve job submission and is probable to lead submitted jobs into pending in case of there are insufficient computing nodes associated to this personality. Afterward we present our measures focused on autonomic capacity management by taking advantage of Cloud insight to provision capacities dynamically on demand. Such measures are capable of selfadaptively adjusting cluster capacities to form different personalities for varying job submission, by the repurposeability to coordinate capacities from the idle personality with lower running jobs to the other with higher demand on pending jobs. Finally we verify that, proposals in this paper are significantly in achieving optimized throughput by reducing the number of pending jobs in cost-efficiency, with a couple of simulations.
机译:高性能计算(HPC)利用集群结合一组计算节点利用计算能力来处理改变的科学计算工作的工作提交。应用不适当的容量规划和相关管理机制,将使HPC集群引入相当大量的待处理作业,被认为是影响系统吞吐量的吞吐量的关键因素。此外,它将导致低效率和浪费的能力成本。因此,本文提出了自主能力管理方法,以克服此类问题。首先,我们近期调查了近期相关的研究,发现他们对计算节点的个性缺乏考虑,这对解决工作提交至关重要,并且可能导致提交的工作,以便在与这个人格相关的计算节点不足。之后,我们展示了我们的措施,专注于自主能力管理通过利用云洞察力,以便按需动态地提供能力。这种措施能够自私地调整集群能力,以形成不同的个性,以改变就业提交,通过重新核肉性,以协调怠速个性的能力,以较低的运行就业机会对等待作业的需求较高。最后,我们通过减少成本效率的成本效率的数量来验证,本文的提案在实现优化的吞吐量方面,有几个模拟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号