首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Exploiting Pipelined Encoding Process to Boost Erasure-Coded Data Archival
【24h】

Exploiting Pipelined Encoding Process to Boost Erasure-Coded Data Archival

机译:利用流水线编码过程增强擦除编码数据归档

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper addresses an issue of , where () erasure codes are employed to archive rarely accessed replicas. The traditional process neither leverages the existence of replicas, nor handles encoding operations in a decentralized manner. To overcome these drawbacks, we exploit pipelined encoding processes to boost the data archival performance on storage clusters. First, we propose two data layouts called and by applying a chained-declustering mechanism to both and redundancy groups. Second, in light of the and layouts, we design two archiving schemes named and , which exhibit the following three - alient features: (i) exploiting data locality—two or three local blocks are read by each involved node for encoding; (ii) decentralized computation load—encoding operations are distributed among nodes; and (iii) parallel archival processing—two or three encoding pipelines are simultaneously deployed to generate parity blocks. We implement both the and schemes and three existing solutions (i.e., SynE, DE, and RapidRAID) in a real-world storage cluster. Experimental results show that our archival schemes outperform the other three solutions in terms of archiving time by a factor of at least 3.41 in a nine-node storage cluster. The experiments strongly indicate that the performance bottleneck of SynE lies in its block-receiving stage; it is disk I/O rather than network traffic that dominates archiving time for both the DE and RapidRAID schemes.
机译:本文解决的问题,其中()擦除代码用于存档很少访问的副本。传统过程既不利用副本的存在,也不以分散的方式处理编码操作。为了克服这些缺点,我们利用流水线编码过程来提高存储集群上的数据归档性能。首先,我们提出了两个数据布局,称为,并通过对冗余组和冗余组都应用了链式分簇机制。其次,根据和布局,我们设计了两个归档方案,分别命名为和,它们表现出以下三种与众不同的特征:(i)利用数据局部性—每个相关节点读取两个或三个局部块进行编码; (ii)分散的计算负荷-编码操作分布在节点之间; (iii)并行存档处理-同时部署两个或三个编码管道以生成奇偶校验块。我们在实际的存储集群中同时实施和方案以及三个现有解决方案(即SynE,DE和RapidRAID)。实验结果表明,在九节点存储集群中,就归档时间而言,我们的归档方案优于其他三个解决方案,其倍数至少为3.41。实验强烈表明,SynE的性能瓶颈在于其块接收阶段。 DE和RapidRAID方案的归档时间主要是磁盘I / O而不是网络流量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号