【24h】

DedupeSwift: Object-Oriented Storage System Based on Data Deduplication

机译:DedupeSwift:基于重复数据删除的面向对象存储系统

获取原文
获取原文并翻译 | 示例

摘要

Recent years have witnessed the explosion of the data universe. Facing the rapid growth of the data size, cloud storage is proposed as an approach to provide cost-efficient and reliable data storage service. As data size grows, data centers providing cloud storage service need more storage resources to meet the ever-increasing requirements. Data deduplication is a technology aiming to remove redundant data blocks. It has been used to reduce the storage footprint of backup and archival systems. In this paper, we propose DedupeSwift, which is based on OpenStack Swift, an open-source object-oriented storage software widely used in public and private clouds. Data deduplication is introduced to reduce the storage overhead. To deal with the performance overhead brought by deduplication, a lazy method is introduced to reduce the disk I/O bottleneck. Compression and caching are also used in the system to improve the read performance. Experimental results show that our proposed DedupeSwift can reduce the storage overhead by 65.24% and 89.84% on the two data sets with favorable upload and download throughput.
机译:近年来见证了数据世界的爆炸式增长。面对数据量的快速增长,提出了云存储作为一种提供经济高效且可靠的数据存储服务的方法。随着数据量的增长,提供云存储服务的数据中心需要更多的存储资源来满足不断增长的需求。重复数据删除是一项旨在删除冗余数据块的技术。它已被用来减少备份和档案系统的存储空间。在本文中,我们提出了DedupeSwift,它基于OpenStack Swift,OpenStack Swift是一种开源的面向对象的存储软件,广泛用于公共云和私有云。引入了重复数据删除以减少存储开销。为了处理重复数据删除带来的性能开销,引入了一种惰性方法来减少磁盘I / O瓶颈。系统中还使用压缩和缓存来提高读取性能。实验结果表明,我们提出的DedupeSwift可以在两个数据集上分别减少65.24%和89.84%的存储开销,并具有良好的上载和下载吞吐量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号