首页> 外文期刊>Concurrency and computation: practice and experience >A decentralized redundancy generation scheme for codeswith locality in distributed storage systems
【24h】

A decentralized redundancy generation scheme for codeswith locality in distributed storage systems

机译:代码分散的冗余生成方案随着分布式存储系统的位置

获取原文
获取原文并翻译 | 示例

摘要

The increasing data volume in a large number of applications presents a dire need for supportingthe reliable datamanagement in distributed storage systems. Existing classical erasure codes,such as the Reed-Solomon codes and locally reconstruction codes, are widely adopted by manydistributed storage systems. However, existing researches mainly focus on proposing new optimizedcodes, ignoring the optimization of the encoding process with the classical codes, whereinefficient encoding process greatly degrades the encoding performance of the distributed storagesystems. Thus, how to complete the encoding process in an efficient way has become thechallenge for adopting the classical codes. In this paper, we propose a decentralized redundancygeneration scheme on the basis of the codes with locality, called D2CP, where a 2-step frameworkis proposed to support both the data patterns (replication to encoding and direct encoding) andcodeswith locality with any parameter set. For improving the insertion throughput,D2CP adoptsa data placement techniquewith consistent hashing to guide the selection of nodes. For reducingthe network traffic cost,D2CP adopts a data sending scheduling technique to schedule the transmissionof the source nodes and a cooperative parity generation technique to generate the paritydata cooperatively. To evaluate the performance of D2CP, we conduct experiments on our RAIDdistributed storage system under various parameter settings with both 30 physical and 200 virtualservers. Extensive experiments confirm that D2CP can improve the encoding throughput by20% and 32% and reduce the network traffic cost by 16% and 33% compared with the typicalapproaches on average for the 2 data patterns respectively
机译:大量应用中增加的数据量呈现了可靠的支持需求分布式存储系统中的可靠性数据管理。现有的古典擦除代码,如芦苇所罗门代码和局部重建代码,许多人被广泛采用分布式存储系统。然而,现有研究主要集中在提出新优化代码,忽略与经典代码的编码过程的优化,其中效率低下编码过程大大降低了分布式存储的编码性能系统。因此,如何以有效的方式完成编码过程已成为采用经典代码的挑战。在本文中,我们提出了分散的冗余基于具有局部性的代码的代码,称为D2CP,其中2步框架建议支持数据模式(复制到编码和直接编码)和带有任何参数集的位置。为了提高插入吞吐量,D2CP采用数据放置技术一致散列,以指导节点的选择。减少网络流量成本,D2CP采用数据发送调度技术来安排传输生成奇偶校验的源节点和合作奇偶校验生成技术数据合作。为了评估D2CP的表现,我们对我们的RAID进行实验具有30个物理和200个虚拟的各种参数设置下的分布式存储系统服务器。广泛的实验证实,D2CP可以通过以下方式改善编码吞吐量与典型相比,20%和32%,降低网络流量成本16%和33%平均接近2个数据模式

著录项

  • 来源
  • 作者单位

    National Key Laboratory for Parallel andDistributed Processing College of ComputerNational University of Defense Technology Changsha Hunan China;

    National Key Laboratory for Parallel andDistributed Processing College of ComputerNational University of Defense Technology Changsha Hunan China;

    National Key Laboratory for Parallel andDistributed Processing College of ComputerNational University of Defense Technology Changsha Hunan China;

    National Key Laboratory for Parallel andDistributed Processing College of ComputerNational University of Defense Technology Changsha Hunan China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    decentralized; erasure codes; encoding process; network traffic cost;

    机译:分散;擦除代码;编码过程;网络流量成本;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号