首页> 外文会议>Data Compression Conference >Combining Deduplication and Delta Compression to Achieve Low-Overhead Data Reduction on Backup Datasets

【24h】

Combining Deduplication and Delta Compression to Achieve Low-Overhead Data Reduction on Backup Datasets

机译：结合重复数据删除和三角形压缩，实现备份数据集的低开销数据减少

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data reduction has become increasingly important in storage systems due to the explosive growth of digital data in the world that has ushered in the big data era. In this paper, we present DARE, a Deduplication-Aware Resemblance detection and Elimination scheme for compressing backup datasets that effectively combines data deduplication and delta compression to achieve high data reduction efficiency at low overhead. The main idea behind DARE is to employ a scheme, call Duplicate-Adjacency based Resemblance Detection (DupAdj), by considering any two data chunks to be similar (i.e., candidates for delta compression) if their respective adjacent data chunks are found to be duplicate in a deduplication system, and then further enhance the resemblance detection efficiency by an improved super-feature approach. Our experimental results based on real-world and synthetic backup datasets show that DARE achieves an additional data reduction by a factor of more than 2 (2X) on top of deduplication with very low overhead while nearly doubling the data restore performance of deduplication-only systems by supplementing delta compression to deduplication.

机译：由于世界上已迎来大数据时代的数字数据的爆炸性增长，数据减少在存储系统中变得越来越重要。在本文中，我们敢于，一种重复数据删除感知的相似性检测和消除方案，用于压缩备份数据集，这些数据集有效地结合了数据重复数据删除和增量压缩，以实现低开销的高数据降低效率。敢于采用一个方案，通过考虑任何两个数据块（即，Delta压缩的候选者），呼叫重复相邻的相互相邻的相似性检测（Dupadj），如果发现它们各自的相邻数据块重复在重复数据删除系统中，然后通过改进的超特征方法进一步提高相似的检测效率。我们基于现实世界和综合备份数据集的实验结果表明，敢于在重复数据删除的顶部额外的数据减少超过2（2倍），同时具有非常低的开销，同时几乎加倍Deaplication Systems的数据恢复性能通过补充Delta压缩重复数据删除。

著录项

来源
《Data Compression Conference》|2014年||共10页
会议地点
作者
Wen Xia; Hong Jiang; Dan Feng; Lei Tian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.56-53;
关键词
Backup Datasets; DupAdj; big data era;

机译：备份数据集;dupadj;大数据时代;

相似文献

外文文献
中文文献
专利

1. WAN-Optimized Replication of Backup Datasets Using Stream-Informed Delta Compression [J] . PHLIP SHILANE, MARK HUANG, GRANT WALLACE, ACM Transactions on Storage . 2012,第4期

机译：使用流通知的增量压缩进行WAN优化的备份数据集复制
2. Deduplication, Reduction And Compliance - Backup Your Data [J] . Database and network journal . 2008,第4期

机译：重复数据删除，还原和合规性-备份数据
3. Ddelta: A deduplication-inspired fast delta compression approach [J] . Wen Xia, Hong Jiang, Dan Feng, Performance Evaluation . 2014,第sepa期

机译：Delta：一种基于重复数据删除的快速增量压缩方法
4. Combining Deduplication and Delta Compression to Achieve Low-Overhead Data Reduction on Backup Datasets [C] . Wen Xia, Hong Jiang, Dan Feng, Data Compression Conference . 2014

机译：结合重复数据删除和三角形压缩，实现备份数据集的低开销数据减少
5. Collocated Data Deduplication for Virtual Machine Backup in the Cloud. [D] . Zhang, Wei. 2014

机译：用于云中虚拟机备份的并置重复数据删除。
6. DOMe: A deduplication optimization method for the NewSQL database backups [O] . Longxiang Wang, Zhengdong Zhu, Xingjun Zhang, -1

机译：DOMe：NewSQL数据库备份的重复数据删除优化方法
7. WAN optimized replication of backup datasets using stream-informed delta compression [O] . Philip Shilane, Mark Huang, Grant Wallace, 2012

机译：WAN使用流通知的增量压缩优化备份数据集的复制

Combining Deduplication and Delta Compression to Achieve Low-Overhead Data Reduction on Backup Datasets

摘要

著录项

相似文献

相关主题

期刊订阅