首页>
外国专利>
PROACTIVE FAILURE RECOVERY MODEL FOR DISTRIBUTED COMPUTING
PROACTIVE FAILURE RECOVERY MODEL FOR DISTRIBUTED COMPUTING
展开▼
机译:分布式计算的主动故障恢复模型
展开▼
页面导航
摘要
著录项
相似文献
摘要
This disclosure generally describes methods and systems, including computer-implemented methods, computer-program products, and computer systems, for providing a proactive failure recovery model for distributed computing. One computer-implemented method includes building a virtual tree-like computing structure of a plurality of computing nodes, for each computing node of the virtual tree-like computing structure, performing, by a hardware processor, a node failure prediction model to calculate a mean time between failure (MTBF) associated with the computing node, determining whether to perform a checkpoint of the computing node based on a comparison between the calculated MTBF and a maximum and minimum threshold, migrating a process from the computing node to a different computing node acting as a recovery node, and resuming execution of the process on the different computing node.
展开▼