首页> 外国专利> DYNAMICALLY REASSIGNING A CONNECTED NODE TO A BLOCK OF COMPUTE NODES FOR RE-LAUNCHING A FAILED JOB

DYNAMICALLY REASSIGNING A CONNECTED NODE TO A BLOCK OF COMPUTE NODES FOR RE-LAUNCHING A FAILED JOB

机译:动态地将连接的节点重新分配到计算机节点块以重新启动失败的作业

摘要

Methods, systems, and products for dynamically reassigning a connected node to a block of compute nodes for re-launching a failed job that include: identifying that a job failed to execute on the block of compute nodes because connectivity failed between a compute node assigned as at least one of the connected nodes for the block of compute nodes and its supporting I/O node; and re-launching the job, including selecting an alternative connected node that is actively coupled for data communications with an active I/O node; and assigning the alternative connected node as the connected node for the block of compute nodes running the re-launched job.
机译:用于将连接的节点动态地重新分配给计算节点块以重新启动失败的作业的方法,系统和产品,包括:识别作业未能在计算节点块上执行,因为分配为的计算节点之间的连接失败计算节点块及其支持的I / O节点的至少一个连接节点;重新启动该作业,包括选择与活动的I / O节点进行主动耦合以进行数据通信的备用连接节点;并将替代连接节点分配为运行重新启动的作业的计算节点块的连接节点。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号