首页> 外文期刊>Computer networks >Lifetime and availability of data stored on a P2P system: Evaluation of redundancy and recovery schemes
【24h】

Lifetime and availability of data stored on a P2P system: Evaluation of redundancy and recovery schemes

机译:P2P系统上存储的数据的生命周期和可用性:评估冗余和恢复方案

获取原文
获取原文并翻译 | 示例

摘要

This paper studies the performance of Peer-to-Peer storage and backup systems (P2PSS). These systems are based on three pillars: data fragmentation and dissemination among the peers, redundancy mechanisms to cope with peers churn and repair mechanisms to recover lost or temporarily unavailable data. Usually, redundancy is achieved either by using replication or by using erasure codes. A new class of network coding (regenerating codes) has been proposed recently. Therefore, we will adapt our work to these three redundancy schemes. We introduce two mechanisms for recovering lost data and evaluate their performance by modeling them through absorbing Markov chains. Specifically, we evaluate the quality of service provided to users in terms of durability and availability of stored data for each recovery mechanism and deduce the impact of its parameters on the system performance. The first mechanism is centralized and based on the use of a single server that can recover multiple losses at once. The second mechanism is distributed: reconstruction of lost fragments is iterated sequentially on many peers until that the required level of redundancy is attained. The key assumptions made in this work, in particular, the assumptions made on the recovery process and peer on-times distribution, are in agreement with the analysis in and in respectively. The models are thereby general enough to be applicable to many distributed environments as shown through numerical computations. We find that, in stable environments such as local area or research institute networks where machines are usually highly available, the distributed-repair scheme in erasure-coded systems offers a reliable, scalable and cheap storage/backup solution. For the case of highly dynamic environments, in general, the distributed-repair scheme is inefficient, in particular to maintain high data availability, unless the data redundancy is high. Using regenerating codes overcomes this limitation of the distributed-repair scheme. P2PSS with centralized-repair scheme are efficient in any environment but have the disadvantage of relying on a centralized authority. However, the analysis of the overhead cost (e.g. computation, bandwidth and complexity cost) resulting from the different redundancy schemes with respect to their advantages (e.g. simplicity), is left for future work.
机译:本文研究对等存储和备份系统(P2PSS)的性能。这些系统基于三大支柱:对等体之间的数据分段和分发,应对对等体搅动的冗余机制以及修复丢失或暂时不可用的数据的修复机制。通常,通过使用复制或使用擦除代码来实现冗余。最近已经提出了一类新的网络编码(再生代码)。因此,我们将使我们的工作适应这三种冗余方案。我们介绍了两种恢复丢失数据的机制,并通过吸收马尔可夫链对它们进行建模来评估其性能。具体来说,我们根据持久性和每种恢复机制所存储数据的可用性来评估提供给用户的服务质量,并推断出其参数对系统性能的影响。第一种机制是集中式的,并且基于单个服务器的使用,该服务器可以一次恢复多个损失。第二种机制是分布式的:在许多对等点上顺序地迭代丢失片段的重建,直到达到所需的冗余级别。这项工作中做出的关键假设,特别是对恢复过程和同伴按时分配所做的假设分别与in和in中的分析一致。该模型因此足够通用,如数值计算所示,可适用于许多分布式环境。我们发现,在诸如局域网或研究所网络等稳定的环境中,这些机器通常都是高度可用的机器,采用擦除编码的系统中的分布式修复方案可提供可靠,可扩展且廉价的存储/备份解决方案。通常,对于高度动态的环境,除非数据冗余度高,否则分布式修复方案效率低下,尤其是无法维持高数据可用性。使用再生代码克服了分布式修复方案的这一限制。具有集中式修复方案的P2PSS在任何环境中均有效,但是具有依赖集中式授权的缺点。然而,对于不同的冗余方案的优点(例如,简单性),由不同的冗余方案产生的开销成本(例如,计算,带宽和复杂性成本)的分析留给以后的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号