首页> 外文会议>International Conference on Software Security and Reliability >An Empirical Study on Data Retrievability in Decentralized Erasure Code Based Distributed Storage Systems
【24h】

An Empirical Study on Data Retrievability in Decentralized Erasure Code Based Distributed Storage Systems

机译:基于分散式擦除码的分布式存储系统中数据可检索性的实证研究

获取原文

摘要

Erasure codes are applied in distributed storage systems to provide data robustness against server failures by storing data redundancy among many storage servers. A (n, k) erasure code encodes a data object, which is represented as k elements, into a codeword of n elements such that any k out of these n codeword elements can recover the data object back. Decentralized erasure codes are proposed for distributed storage systems without a central authority. The characteristic of decentralization makes resulting storage systems more scalable and suitable for loosely-organized networking environments. However, different from conventional erasure codes, decentralized erasure codes trade some probability of a successful data retrieval for decentralization. Although theoretical lower bounds on the probability are overwhelming from a theoretical aspect, it is essential to know what the data retrievability is in real applications from a practical aspect. We focus on decentralized erasure code based storage systems and investigate data retrievability from both theoretical and practical aspects. We conduct simulation for random processes of storage systems to evaluate data retrievability. Then we compare simulation results and analytical values from theoretical bounds. By our comparison, we find that data retrievability is underestimated by those bounds. Data retrievability is over 99% in most cases in our simulations, where the order of the used finite field is an 8-bit prime. Data retrievability can be enlarged by using a larger finite field. We believe that data retrievability of decentralized erasure code based storage systems is acceptable for real applications.
机译:通过在许多存储服务器之间存储数据冗余,在分布式存储系统中应用擦除代码,以提供针对服务器故障的数据健壮性。 (n,k)个擦除码将表示为k个元素的数据对象编码为n个元素的代码字,以使这n个代码字元素中的任何k个都可以恢复该数据对象。对于没有中央授权的分布式存储系统,提出了分散式擦除代码。分散的特性使最终的存储系统具有更高的可扩展性,并适合于松散组织的网络环境。然而,与传统的擦除码不同,分散的擦除码以某种成功的概率为分散进行了数据检索。尽管从理论上讲,概率的理论下限是压倒性的,但从实践的角度来了解实际应用中的数据可检索性是至关重要的。我们专注于基于分散式擦除码的存储系统,并从理论和实践两个方面研究数据的可检索性。我们对存储系统的随机过程进行仿真,以评估数据的可检索性。然后,我们从理论范围比较仿真结果和分析值。通过比较,我们发现这些界限低估了数据的可检索性。在我们的仿真中,大多数情况下数据可检索性超过99%,其中使用的有限域的顺序是8位素数。可以通过使用更大的有限域来扩大数据可检索性。我们认为,基于分散式擦除代码的存储系统的数据可检索性对于实际应用是可以接受的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号