首页> 外文会议>IEEE Symposium on Computers and Communication >NF-Dedupe: A Novel No-fingerprint Deduplication Scheme for Flash-based SSDs
【24h】

NF-Dedupe: A Novel No-fingerprint Deduplication Scheme for Flash-based SSDs

机译:NF-DEDUPE:基于闪存的SSD的一种新型无指纹重复数据删除方案

获取原文

摘要

NAND flash-based Solid State Drives (SSDs) have been widely deployed in data centers of cloud computing due to their high performance compared with hard disks, while the limited lifespan of flash memory makes SSDs not very suitable for write-intensive applications. Deduplication is an effective method used to reduce the write traffic of applications thus can be used to extend the lifespan of SSDs. However, traditional deduplication schemes rely on the time-consuming fingerprint computing process to find duplicated data, which may impair the write performance of SSDs. Accordingly, Pre-hashing was proposed to reduce the chances of fingerprint computing thus improving the performance of SSDs with deduplication, but at the cost of degrading deduplication rate. In this paper, we propose NF-Dedupe, a new deduplication scheme that needs no fingerprint computing for flash-based SSDs. NF-Dedupe determines whether a write page is duplicated or not by comparing the write page with its potential duplicated page read from underlying flash chips byte by byte, rather than relying on the comparison of fingerprints. As flash memory is known for its high parallelism and low read latency, reading a page from flash chip and comparing two pages byte by byte introduce lower overhead than the fingerprint computing does. We evaluate the NF-Dedupe via trace-driven simulations. Experimental results have shown that NF-Dedupe outperforms the other approaches and can achieve the deduplication rate ranging from 5.3% to 29.9% and the write latency is improved by a factor of up to 21% with an average of 12%.
机译:NAND基于闪存的固态驱动器(SSD)已广泛部署在云计算的数据中心,由于它们的高性能与硬盘相比,闪存的有限寿命使SSD不适合写入密集型应用。重复数据删除是一种有效的方法,用于减少应用程序的写入流量,从而可以用于扩展SSD的寿命。但是,传统的重复数据删除方案依赖于耗时的指纹计算过程来查找重复数据,这可能会损害SSD的写性能。因此,提出了预先散列以减少指纹计算的机会,从而提高了SSD的性能与重复数据删除,但以降级的重复数据删除率降低。在本文中,我们提出了NF-DEDUPE,一种新的重复数据删除方案,不需要针对基于闪存的SSD的指纹计算。 NF-DEDUPE通过将写入页与其潜在的复制页面进行比较,确定写页是否复制,而不是从基础闪存码头字节读取字节,而不是依赖于指纹的比较。由于闪存以其高并行性和低读取延迟而闻名,请从Flash芯片读取页面并将两个页面字节与字节相比,从字节引入较低的开销,而不是指纹计算。我们通过跟踪驱动模拟评估NF-DEDUPE。实验结果表明,NF-DEDUPE优于其他方法,可以达到5.3%至29.9%的重复性递抄率,并且写入延迟提高了高达21%,平均为12%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号