...
首页> 外文期刊>Computers, IEEE Transactions on >Efficient Deduplication Techniques for Modern Backup Operation
【24h】

Efficient Deduplication Techniques for Modern Backup Operation

机译:用于现代备份操作的高效重复数据删除技术

获取原文
获取原文并翻译 | 示例
           

摘要

In this work, we focus on optimizing the deduplication system by adjusting the pertinent factors in fingerprint lookup and chunking, the factors which we identify as the key ingredients of efficient deduplication. For efficient fingerprint lookup, we propose fingerprint management scheme called LRU-based Index Partitioning. For efficient chunking, we propose Incremental Modulo-K(INC-K) algorithm which is optimized Rabin's algorithm where we significantly reduce the number of arithmetic operations exploiting the algebraic nature of modulo arithmetic. LRU-based Index Partitioning uses the notion of tablet and enforces access locality of the fingerprint lookup in storing fingerprints. We maintain tablets with LRU manner to exploit temporal locality of the fingerprint lookup. To preserve access correlation across the tablets, we apply prefetching in maintaining tablet list. We propose Context-aware chunking to maximize chunking speed and deduplication ratio. We develop prototype backup system and performed comprehensive analysis on various factors and their relationship: average chunk size, chunking speed, deduplication ratio, tablet management algorithms, and overall backup speed. By increasing the average chunk size from 4 KB to 10 KB, chunking time increases by 34.3 percent, deduplication ratio decreases by 0.66 percent and the overall backup speed increases by 50 percent (from 51.4 MB/sec to 77.8 MB/sec).
机译:在这项工作中,我们专注于通过调整指纹查找和分块中的相关因素来优化重复数据删除系统,这些因素被我们确定为有效重复数据删除的关键要素。为了有效地进行指纹查找,我们提出了一种称为基于LRU的索引分区的指纹管理方案。为了进行有效的分块,我们提出了增量模-K(INC-K)算法,该算法是优化的拉宾算法,其中我们利用模算术的代数性质显着减少了算术运算的数量。基于LRU的索引分区使用Tablet的概念,并在存储指纹时强制执行指纹查找的访问位置。我们以LRU方式维护平板电脑,以利用指纹查找的时间局部性。为了保持平板电脑之间的访问相关性,我们在维护平板电脑列表中应用了预取。我们提出了上下文感知分块以最大化分块速度和重复数据删除率。我们开发了原型备份系统,并对各种因素及其关系进行了综合分析:平均块大小,块速度,重复数据删除率,平板电脑管理算法和整体备份速度。通过将平均数据块大小从4 KB增加到10 KB,数据块化时间增加了34.3%,重复数据删除率降低了0.66%,总体备份速度提高了50%(从51.4 MB /秒增加到77.8 MB /秒)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号