首页> 外文会议>IEEE 35th Annual IEEE International Conference on Computer Communications >On the synchronization bottleneck of OpenStack Swift-like cloud storage systems
【24h】

On the synchronization bottleneck of OpenStack Swift-like cloud storage systems

机译:类似于OpenStack Swift的云存储系统的同步瓶颈

获取原文
获取原文并翻译 | 示例

摘要

As one type of the most popular cloud storage services, OpenStack Swift and its follow-up systems replicate each data object across multiple storage nodes and leverage object sync protocols to achieve high availability and eventual consistency. The performance of object sync protocols heavily relies on two key parameters: r (number of replicas for each object) and η (number of objects hosted by each storage node). In existing tutorials and demos, the configurations are usually r = 3 and n 3 and n ≫ 1000, the object sync process is significantly delayed and produces massive network overhead. This phenomenon is referred to as the sync bottleneck problem. Then, to explore the root cause, we review the source code of OpenStack Swift and find that its object sync protocol utilizes a fairly simple and network-intensive approach to check the consistency among replicas of objects. In particular, each storage node is required to periodically multicast the hash values of all its hosted objects to all the other replica nodes. Thus in a sync round, the number of exchanged hash values per node is Θ(n×r). Further, to tackle the problem, we propose a lightweight object sync protocol called LightSync. It remarkably reduces the sync overhead by using two novel building blocks: 1) Hashing of Hashes, which aggregates all the h hash values of each data partition into a single but representative hash value with the Merkle tree; 2) Circular Hash Checking, which checks the consistency of different partition replicas by only sending the aggregated hash value to the clockwise neighbor. Its design provably reduces the per-node network overhead from Θ(n×r) to Θ(n/h). In addition, we have implemented LightSync as an open-source patch and adopted it to OpenStack Swift, thus reducing sync delay by up to 28.8× and network overhead by up to 14.2×.
机译:作为最流行的云存储服务的一种,OpenStack Swift及其后续系统跨多个存储节点复制每个数据对象,并利用对象同步协议来实现高可用性和最终的一致性。对象同步协议的性能在很大程度上取决于两个关键参数:r(每个对象的副本数)和η(每个存储节点托管的对象数)。在现有的教程和演示中,配置​​通常为r = 3且n 3且n≤1000,对象同步过程显着延迟,并产生大量网络开销。这种现象称为同步瓶颈问题。然后,为了探究根本原因,我们回顾了OpenStack Swift的源代码,发现其对象同步协议利用一种相当简单且占用大量网络资源的方法来检查对象副本之间的一致性。特别是,需要每个存储节点定期将其所有托管对象的哈希值多播到所有其他副本节点。因此,在同步回合中,每个节点交换的哈希值的数量为Θ(n×r)。此外,为解决该问题,我们提出了一种称为LightSync的轻量级对象同步协议。它通过使用两个新颖的构建块显着减少了同步开销:1)哈希散列,它通过Merkle树将每个数据分区的所有h散列值聚合为单个但有代表性的散列值; 2)循环哈希检查,它通过仅将聚合哈希值发送到顺时针邻居来检查不同分区副本的一致性。它的设计可证明将每个节点的网络开销从Θ(n×r)减少到Θ(n / h)。此外,我们已经将LightSync实施为一个开放源代码补丁,并将其用于OpenStack Swift,从而将同步延迟降低了28.8倍,将网络开销降低了14.2倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号