首页>
外国专利>
System and method of active risk management to reduce job de-scheduling probability in computer clusters
System and method of active risk management to reduce job de-scheduling probability in computer clusters
展开▼
机译:减少计算机集群中的工作调度风险的主动风险管理系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems and methods are provided for generating backup tasks (138) for a plurality of tasks (120) scheduled to run in a computer cluster (100). Each scheduled task is associated with a target probability for execution, and is executable by a first cluster element (102) and a second cluster element (104, 106). The system classifies the scheduled tasks into groups based on resource requirements of each task (602). The system determines the number of backup tasks to be generated. The number of backup tasks (528-532) is determined in a manner necessary to guarantee that the scheduled tasks satisfy the target probability for execution (800). The backup tasks are desirably identical for a given group. And each backup task can replace nay scheduled task in the given group.
展开▼