In this paper, we design and implement a fault-tolerant cluster management tool which can monitor the status of the nodes in a cluster DBMS (Database Management System) as well as the status of the various DBMS instances at a given node. Based on the monitoring, our cluster management tool can provide an intelligent fail-over mechanism for dealing with multiple node failures in a cluster DBMS, thus making its fault-tolerance possible when combined with a load balancer. We implement our cluster management tool by using Linux virtual server 0.81 as a load balancer and iBASE/Cluster as a cluster-based DBMS, under a cluster system with four server nodes. It is shown that the sensing and recovery time of our cluster management tool for the iBASE/Cluster is about 2 seconds while that of OCMS for Oracle DBMS is more than 20 seconds. Finally we show that our cluster management tool can support the nonstop and self-managing of cluster DBMS with its intelligent fail-over mechanism even in case of multiple node failures.
展开▼
机译:在本文中,我们设计并实现了容错群集管理工具,可以监视群集DBMS(数据库管理系统)中节点的状态以及给定节点处的各种DBMS实例的状态。基于监控,我们的群集管理工具可以提供智能故障转移机制,用于处理群集DBMS中的多个节点故障,从而与负载均衡器组合时可能的容错。在具有四个服务器节点的群集系统下,我们使用Linux Virtual Server 0.81作为负载均衡器和IBase / Cluster来实现我们的群集管理工具。结果表明,IBASE /群集的群集管理工具的感应和恢复时间约为2秒,而ORACLE DBMS的OCMS的ICM是大约20秒。最后,我们表明,即使在多个节点故障的情况下,我们的群集管理工具也可以通过其智能故障转移机制支持群集DBMS的不间断和自我管理。
展开▼