Fault-tolerant replication management in large-scale distributed storage systems

机译：大型分布式存储系统中的容错复制管理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Failures of all forms happen: from losing single network packets to site-wide disasters. Since businesses rely heavily on their data, it is imperative that failures require minimal time and effort to repair and that the service interruption during the failure or repair period should be as short as possible. To this end, the ideal system should repair itself relying on humans only when absolutely necessary in the repair process. This paper describes one component of a self-healing storage system: the component that allows for automatic recovery of access to data when the power comes back on after a large-scale outage. Our failure recovery, protocol is part of a suite of modular protocols that make up the Palladio distributed storage system. This protocol guarantees that service will be repaired quickly and automatically when enough failures are repaired.

机译：各种形式的故障都会发生：从丢失单个网络数据包到站点范围的灾难。由于企业严重依赖其数据，因此必须以最少的时间和精力来进行故障修复，并且在故障或修复期间的服务中断应尽可能短。为此，理想的系统应该仅在修复过程中绝对必要时才依靠人类进行自我修复。本文介绍了自我修复存储系统的一个组件：该组件允许在大规模停电后重新打开电源时自动恢复对数据的访问。我们的故障恢复协议是构成Palladio分布式存储系统的模块化协议套件的一部分。此协议保证在修复足够的故障后，将自动快速修复服务。

著录项

来源
《Reliable Distributed Systems, 1999. Proceedings of the 18th IEEE Symposium on》|1999年|P.144-155|共12页
会议地点
作者
Golding; R.; Borowsky; E.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. On fault-tolerant data replication in distributed systems [J] . Fathi Tenzekhti, Khaled Day, Mohamed Ould-Khaoua Microprocessors and microsystems . 2002,第7期

机译：关于分布式系统中的容错数据复制
2. Module replication for fault-tolerant real-time distributed systems [J] . Varvarigou T.A., Trotter J. IEEE Transactions on Reliability . 1998,第1期

机译：容错实时分布式系统的模块复制
3. Module replication for fault-tolerant real-time distributed systems [J] . Varvarigou T.A., Trotter J. IEEE Transactions on Reliability . 1998,第1期

机译：容错实时分布式系统的模块复制
4. Fault-tolerant replication management in large-scale distributed storage systems [C] . Richard Golding, Elizabeth Borowsky IEEE Symposium on Reliable Distributed Systems . 1999

机译：大规模分布式存储系统中容错复制管理
5. Game theoretical data replication techniques for large-scale autonomous distributed computing systems. [D] . Khan, Samee Ullah. 2007

机译：大型自治分布式计算系统的博弈论数据复制技术。
6. Designing fault-tolerant distributed archives for picture archiving and communication systems [O] . Rebecca Mendenhall, Matt Dewey, Ian Soutar 2001

机译：设计用于图像存档和通信系统的容错分布式档案
7. Low Cost Management of Replicated Data in Fault-Tolerant Distributed Systems [O] . Birman, Kenneth P., Joseph, Thomas A. 1984

机译：容错分布式系统中复制数据的低成本管理

Fault-tolerant replication management in large-scale distributed storage systems

摘要

著录项

相似文献

相关主题

期刊订阅