首页> 外文会议>2011 17th IEEE International Conference on Parallel and Distributed Systems >A Failure Detection Service for Internet-Based Multi-AS Distributed Systems
【24h】

A Failure Detection Service for Internet-Based Multi-AS Distributed Systems

机译:基于Internet的Multi-AS分布式系统的故障检测服务

获取原文

摘要

Failure detectors are one of the basic building blocks of fault-tolerant distributed systems. A failure detector is a distributed oracle that provides information about the state of processes of a distributed system. This work presents a failure detector service for Internet-based distributed systems that span multiple autonomous systems. The service is based on monitors which are capable of providing global process state information through a SNMP interface. A monitor executes on each network where processes are monitored. Monitors at different networks communicate across the Internet using Web Services. The system was implemented and evaluated for monitored processes running both at a single LAN and distributed throughout the world in Planet Lab. Experimental results are presented, showing CPU usage, failure detection latency, and mistake rate.
机译:故障检测器是容错分布式系统的基本组成部分之一。故障检测器是一种分布式预言机,它提供有关分布式系统进程状态的信息。这项工作为跨多个自治系统的基于Internet的分布式系统提供了故障检测器服务。该服务基于能够通过SNMP接口提供全局过程状态信息的监视器。监视程序在监视进程的每个网络上执行。不同网络中的监视器使用Web服务在Internet上进行通信。该系统是针对在单个LAN上运行并在Planet Lab中分布在全球的受监视过程进行实施和评估的。给出了实验结果,显示了CPU使用率,故障检测延迟和错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号