【24h】

An Undirected Graph Traversal Based Grouping Prediction Method for Data De-duplication

机译:一种基于无向图遍历的重复数据分组预测方法

获取原文
获取原文并翻译 | 示例

摘要

The data capacity of the de-duplication system, which is limited by the memory, is difficult to carry out a large-scale expansion. To solve this problem, the paper proposes a hash table grouping prediction method based on undirected graph traversal. This method exploits the indexing table replacement, which is similar to the virtual memory cache replacement, to expand the storage capacity of the data de-duplication system without increasing the system memory. The hit rate of the grouping prediction and system performance are improved by grouping index entries based on undirected graph traversal. Experimental results show that, based on the cache prefetching and the hash table grouping, the memory consuming takes up 10% of the index table size while the capacity equally rise to 10 times of original. The method can make the index table cache hit rate increased to 87.6%, comparing 47% without group in dataset 1 of our experiment, make the performance acceptable.
机译:受存储器限制的重复数据删除系统的数据容量难以大规模扩展。针对这一问题,提出了一种基于无向图遍历的哈希表分组预测方法。此方法利用索引表替换(类似于虚拟内存缓存替换)来扩展重复数据删除系统的存储容量,而无需增加系统内存。通过基于无向图遍历对索引条目进行分组,可以提高分组预测的命中率和系统性能。实验结果表明,基于高速缓存的预取和哈希表分组,内存消耗占索引表大小的10%,而容量则相等于原始表的10倍。该方法可使索引表的缓存命中率提高到87.6%,与本实验数据集1中无组的47%相比,性能令人满意。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号