首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >I/O-Efficient Scaling Schemes for Distributed Storage Systems with CRS Codes
【24h】

I/O-Efficient Scaling Schemes for Distributed Storage Systems with CRS Codes

机译:具有CRS代码的分布式存储系统的I / O有效扩展方案

获取原文
获取原文并翻译 | 示例

摘要

System scaling becomes essential and indispensable for distributed storage systems due to the explosive growth of data volume. Considering that fault-protection is a necessity in large-scale distributed storage systems, and Cauchy Reed-Solomon (CRS) codes are widely deployed to tolerate multiple simultaneous node failures, this paper studies the scaling problem of distributed storage systems with CRS codes. In particular, we formulate the scaling problem with an optimization model in which both the post-scaling encoding matrix and the data migration policy are assumed to be unknown in advance. To minimize the I/O overhead, we propose a three-phase optimization scaling scheme for CRS codes. Specifically, we first derive the optimal post-scaling encoding matrix under a given data migration policy, then optimize the data migration process using the selected post-scaling encoding matrix, and finally exploit the Maximum Distance Separable (MDS) property to further optimize the designed data migration process. Our scaling scheme requires minimal data movement while achieving uniform data distribution. Moreover, it requires to read fewer data blocks than conventional minimum data migration schemes, but still guarantees the minimum amount of migrated data. To validate the efficiency of our scheme, we implement it atop a networked file system. Extensive experiments show that our scaling scheme uses less scaling time than the basic scheme.
机译:由于数据量的爆炸性增长,系统扩展对于分布式存储系统而言必不可少且不可或缺。考虑到大型分布式存储系统中故障保护的必要性,并且广泛使用柯西·里德-所罗门(Cuchy Reed-Solomon)代码来容忍多个同时发生的节点故障,本文研究了使用CRS代码的分布式存储系统的扩展性问题。特别是,我们用优化模型来表示缩放问题,在该模型中,假定缩放后编码矩阵和数据迁移策略都事先未知。为了最小化I / O开销,我们为CRS代码提出了一个三相优化缩放方案。具体来说,我们首先根据给定的数据迁移策略导出最佳的后缩放编码矩阵,然后使用选定的后缩放编码矩阵优化数据迁移过程,最后利用最大距离可分离性(MDS)属性进一步优化设计的数据迁移过程。我们的扩展方案要求最小的数据移动,同时实现均匀的数据分发。而且,与传统的最小数据迁移方案相比,它需要读取更少的数据块,但仍保证了最小量的迁移数据。为了验证我们方案的效率,我们在网络文件系统上实现了该方案。大量实验表明,我们的缩放方案比基本方案使用更少的缩放时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号