首页> 外文会议>IEEE International Conference on Big Data Science and Engineering >DedupeSwift: Object-Oriented Storage System Based on Data Deduplication
【24h】

DedupeSwift: Object-Oriented Storage System Based on Data Deduplication

机译:DEDUPESWIFT:基于数据重复数据删除的面向对象的存储系统

获取原文

摘要

Recent years have witnessed the explosion of the data universe. Facing the rapid growth of the data size, cloud storage is proposed as an approach to provide cost-efficient and reliable data storage service. As data size grows, data centers providing cloud storage service need more storage resources to meet the ever-increasing requirements. Data deduplication is a technology aiming to remove redundant data blocks. It has been used to reduce the storage footprint of backup and archival systems. In this paper, we propose DedupeSwift, which is based on OpenStack Swift, an open-source object-oriented storage software widely used in public and private clouds. Data deduplication is introduced to reduce the storage overhead. To deal with the performance overhead brought by deduplication, a lazy method is introduced to reduce the disk I/O bottleneck. Compression and caching are also used in the system to improve the read performance. Experimental results show that our proposed DedupeSwift can reduce the storage overhead by 65.24% and 89.84% on the two data sets with favorable upload and download throughput.
机译:近年来见证了数据宇宙的爆炸。面对数据大小的快速增长,云存储被提出为提供具有成本效益和可靠的数据存储服务的方法。随着数据大小的增长,提供云存储服务的数据中心需要更多存储资源来满足不断增加的要求。数据重复数据删除是一种旨在删除冗余数据块的技术。它已被用来减少备份和档案系统的存储空间。在本文中,我们提出了一种基于OpenStack Swift的Dedupeswift,这是一个广泛用于公共和私有云的开源面向对象的存储软件。引入数据重复数据删除以减少存储开销。要处理重复数据删除带来的性能开销,引入了一种延迟方法以减少磁盘I / O瓶颈。在系统中也使用压缩和缓存以提高读取性能。实验结果表明,我们提出的Dedupeswift可以将存储开销降低65.24%和89.84%,两个数据集具有有利上传和下载吞吐量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号