Mobile agent's execution are prone to failures, originating from bad communication, security attacks, agent server crashes, system resources unavailability, network congestion or even deadlock situations. In such events, mobile agents either get lost or damaged (partially or totally) during execution. Making mobile agents fault tolerant is a measure taken to increase the dependability and reliability of agent-based application. Many approaches have been proposed but majority of the existing mobile agent's fault tolerance implementations are designed to either tolerate one of the fault classes or two (such as communication, crash and agent software failure) but not all in any situation. This perhaps, makes it impossible to detect and recover from failures of all types. In this paper, based on the analysis of existing fault tolerance approaches, we proposed a generic fault tolerance framework that consists of a monitoring, planning and recovery process execution phases that can help tolerate failures of all type. The framework is validated using existing implementations approaches.
展开▼