On the synchronization bottleneck of OpenStack Swift-like cloud storage systems

机译：类似于OpenStack Swift的云存储系统的同步瓶颈

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

As one type of the most popular cloud storage services, OpenStack Swift and its follow-up systems replicate each data object across multiple storage nodes and leverage object sync protocols to achieve high availability and eventual consistency. The performance of object sync protocols heavily relies on two key parameters: r (number of replicas for each object) and η (number of objects hosted by each storage node). In existing tutorials and demos, the configurations are usually r = 3 and n 3 and n ≫ 1000, the object sync process is significantly delayed and produces massive network overhead. This phenomenon is referred to as the sync bottleneck problem. Then, to explore the root cause, we review the source code of OpenStack Swift and find that its object sync protocol utilizes a fairly simple and network-intensive approach to check the consistency among replicas of objects. In particular, each storage node is required to periodically multicast the hash values of all its hosted objects to all the other replica nodes. Thus in a sync round, the number of exchanged hash values per node is Θ(n×r). Further, to tackle the problem, we propose a lightweight object sync protocol called LightSync. It remarkably reduces the sync overhead by using two novel building blocks: 1) Hashing of Hashes, which aggregates all the h hash values of each data partition into a single but representative hash value with the Merkle tree; 2) Circular Hash Checking, which checks the consistency of different partition replicas by only sending the aggregated hash value to the clockwise neighbor. Its design provably reduces the per-node network overhead from Θ(n×r) to Θ(n/h). In addition, we have implemented LightSync as an open-source patch and adopted it to OpenStack Swift, thus reducing sync delay by up to 28.8× and network overhead by up to 14.2×.

机译：作为最流行的云存储服务的一种，OpenStack Swift及其后续系统跨多个存储节点复制每个数据对象，并利用对象同步协议来实现高可用性和最终的一致性。对象同步协议的性能在很大程度上取决于两个关键参数：r（每个对象的副本数）和η（每个存储节点托管的对象数）。在现有的教程和演示中，配置通常为r = 3且n 3且n≤1000，对象同步过程显着延迟，并产生大量网络开销。这种现象称为同步瓶颈问题。然后，为了探究根本原因，我们回顾了OpenStack Swift的源代码，发现其对象同步协议利用一种相当简单且占用大量网络资源的方法来检查对象副本之间的一致性。特别是，需要每个存储节点定期将其所有托管对象的哈希值多播到所有其他副本节点。因此，在同步回合中，每个节点交换的哈希值的数量为Θ（n×r）。此外，为解决该问题，我们提出了一种称为LightSync的轻量级对象同步协议。它通过使用两个新颖的构建块显着减少了同步开销：1）哈希散列，它通过Merkle树将每个数据分区的所有h散列值聚合为单个但有代表性的散列值； 2）循环哈希检查，它通过仅将聚合哈希值发送到顺时针邻居来检查不同分区副本的一致性。它的设计可证明将每个节点的网络开销从Θ（n×r）减少到Θ（n / h）。此外，我们已经将LightSync实施为一个开放源代码补丁，并将其用于OpenStack Swift，从而将同步延迟降低了28.8倍，将网络开销降低了14.2倍。

著录项

来源
《IEEE 35th Annual IEEE International Conference on Computer Communications》|2016年|1-9|共9页
会议地点 San Francisco CA(US)
作者
Thierry Titcheu Chekam; Ennan Zhai; Zhenhua Li; Yong Cui; Kui Ren;
展开▼
作者单位

School of Software, TNLIST, and KLISS MoE, Tsinghua University;

Department of Computer Science, Yale University;

School of Software, TNLIST, and KLISS MoE, Tsinghua University;

Department of Computer Science and Technology, Tsinghua University;

Department of Computer Science and Engineering, SUNY Buffalo;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Synchronization; Cloud computing; Protocols; Delays; Servers; Open source software; Data models;

机译：同步;云计算;协议;延迟;服务器;开源软件;数据模型;

相似文献

外文文献
中文文献
专利

1. On the Synchronization Bottleneck of OpenStack Swift-Like Cloud Storage Systems [J] . Mingkang Ruan, Thierry Titcheu, Ennan Zhai, Parallel and Distributed Systems, IEEE Transactions on . 2018,第9期

机译：OpenStack Swift类似云存储系统的同步瓶颈
2. Performance Evaluation of RSA-based Secure Cloud Storage Protocol using OpenStack [J] . M.F.Hyder, S.Tooba, .Waseemullah Engineering Technology and Applied Science Research . 2021,第4期

机译：基于RSA的安全云存储协议使用OpenStack的性能评估
3. Parity Data De-Duplication in All Flash Array-Based OpenStack Cloud Block Storage [J] . Huiseong HEO, Cheongjin AHN, Deok-Hwan KIM IEICE transactions on information and systems . 2016,第5期

机译：所有基于闪存阵列的OpenStack云块存储中的奇偶校验重复数据删除
4. On the Synchronization Bottleneck of OpenStack Swift-like Cloud Storage Systems [C] . Thierry Titcheu Chekam, Ennan Zhai, Zhenhua Li, Annual IEEE International Conference on Computer Communications . 2016

机译：在OpenStack Swift云存储系统的同步瓶颈上
5. Designing Storage and Privacy-Preserving Systems for Large-Scale Cloud Applications [D] . Szekeres, Adriana. 2020

机译：为大型云应用设计存储与隐私保留系统
6. A Systematic Review on Cloud Storage Mechanisms Concerning e-Healthcare Systems [O] . Adnan Tahir, Fei Chen, Habib Ullah Khan, 2020

机译：关于电子医疗保健系统的云存储机制系统综述
7. Building an Object Cloud Storage Service System using OpenStack Swift [O] . Sridevi Bonthu, Y S S R Murthy, M. Srilakshmi 2014

机译：使用OpenStack Swift构建对象云存储服务系统

On the synchronization bottleneck of OpenStack Swift-like cloud storage systems

摘要

著录项

相似文献

相关主题

期刊订阅