首页> 外文学位 >Efficient and secure deduplication for cloud-based backups.
【24h】

Efficient and secure deduplication for cloud-based backups.

机译:针对基于云的备份的高效,安全的重复数据删除。

获取原文
获取原文并翻译 | 示例

摘要

Backup storage based on cloud service is becoming increasingly popular. Deduplication is a key technique that reduces the transmission and storage overhead of backing up large datasets by identifying multiple copies of redundant data.;Elasticity is the ability to scale computing resources such as memory on-demand, and is one of the main advantages of utilizing cloud computing services. With the increasing popularity of cloud based storage, it is natural that more deduplication based storage systems will be migrated to the cloud. Existing deduplication systems however, do not adequately take advantage of elasticity.;In this thesis, we illustrate how to use elasticity to improve deduplication based systems, and propose EAD (elasticity aware deduplication), an indexing algorithm that uses the ability to dynamically increase memory resources to improve overall deduplication performance. Our experimental results indicate that EAD is able to detect more than 98% of all duplicate data, however only consumes less than 5% of expected memory space. Meanwhile, it claims four times of deduplication efficiency than the state-of-art sampling technique while costs less than half of the amount of memory.;Furthermore, as the data growing rapidly in data centers, single-node storage node is no longer be able to provide the corresponding throughput and capacities as expected. Building deduplication clusters is considered as a promising strategy to leverage such bottle-neck on single-node system. However, deduplication relies on how much the system knows about information of previous stored data. The single-node system obviously obtains all such information and is able to detect duplicate data there; however storage nodes in cluster-based system cannot know information on other nodes. It is nontrivial to route data intelligently enough so that the system could support deduplication performance comparable to that of a single-node system, while also at a trivial cost. Thus, we propose an elastic data routing strategy, aiming to achieve deduplication performance comparable to state-of-the-art, while require much less computation resources.;To step further, deduplication as it is currently adopted by cloud backup providers is vulnerable to side-channel attacks. Traditional defenses in cloud computing can prevent such attacks, but are cannot be use together with deduplication. Therefore, we explore the impact of encryption on data uploads to the cloud as well as proposing a solution for cloud-based backup services that combines deduplication and encryption to provide both security and high bandwidth and efficiency. Extensive experiments on real world dataset shows that our solution incurs a small overhead compared to native deduplication while offering strong security protections.
机译:基于云服务的备份存储变得越来越流行。重复数据删除是一项关键技术,它通过识别冗余数据的多个副本来减少备份大型数据集的传输和存储开销。;弹性是按需扩展内存等计算资源的能力,并且是利用的主要优势之一云计算服务。随着基于云的存储的日益普及,更多基于重复数据删除的存储系统自然会迁移到云中。但是,现有的重复数据删除系统没有充分利用弹性。在本文中,我们说明了如何使用弹性来改进基于重复数据删除的系统,并提出了EAD(弹性感知重复数据删除)索引技术,该索引算法使用了动态增加内存的能力资源,以提高整体重复数据删除性能。我们的实验结果表明,EAD能够检测到所有重复数据的98%以上,但是仅消耗了不到5%的预期存储空间。同时,它的重复数据删除效率是最先进的采样技术的四倍,而成本却不到内存的一半。此外,由于数据中心中数据的快速增长,单节点存储节点已不再适用。能够提供预期的相应吞吐量和容量。建立重复数据删除群集被认为是在单节点系统上利用这种瓶颈的有前途的策略。但是,重复数据删除取决于系统对先前存储的数据的信息了解的程度。单节点系统显然可以获取所有此类信息,并且能够在那里检测到重复数据。但是,基于集群的系统中的存储节点无法知道其他节点上的信息。足够智能地路由数据非常重要,因此系统可以支持与单节点系统相当的重复数据删除性能,同时成本也很低。因此,我们提出了一种弹性数据路由策略,旨在实现可与最新技术相媲美的重复数据删除性能,同时所需的计算资源要少得多。进一步,云备份提供商当前采用的重复数据删除容易受到攻击。旁道攻击。云计算中的传统防御可以防止此类攻击,但不能与重复数据删除一起使用。因此,我们探讨了加密对将数据上传到云的影响,并提出了针对基于云的备份服务的解决方案,该解决方案结合了重复数据删除和加密功能以提供安全性,高带宽和效率。在现实世界数据集上进行的大量实验表明,与本机重复数据删除相比,我们的解决方案产生了少量开销,同时提供了强大的安全保护。

著录项

  • 作者

    Wang, Yufeng.;

  • 作者单位

    Temple University.;

  • 授予单位 Temple University.;
  • 学科 Computer science.
  • 学位 M.S.
  • 年度 2015
  • 页码 87 p.
  • 总页数 87
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:52:51

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号