...
首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >PDFS: Partially Dedupped File System for Primary Workloads
【24h】

PDFS: Partially Dedupped File System for Primary Workloads

机译:PDFS:用于主要工作负载的部分重复文件系统

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Primary storage dedup is difficult to be accomplished because of challenges to achieve low IO latency and high throughput while eliminating data redundancy effectively in the critical IO Path. In this paper, we design and implement the PDFS, a partially dedupped file system for primary workloads, which is built on a generalized framework using partial data lookup for efficient searching of redundant data in quickly chosen data subsets instead of the whole data. PDFS improves IO latency and throughput systematically by techniques including write path optimization, data dedup parallelization and write order preserving. Such design choices bring dedup to the masses for general primary workloads. Experimental results show that PDFS achieves 74-99 percent of the theoretical maximum dedup ratio with very small or even negative performance degradations compared with main stream file systems without dedup support. Discussions about varied configuring experiences of PDFS are also carried out.
机译:由于要实现低IO延迟和高吞吐量同时又要在关键IO路径中有效消除数据冗余的挑战,很难实现主存储的重复数据删除。在本文中,我们设计和实现了PDFS,这是一种用于主要工作负载的部分去重复的文件系统,该文件系统建立在使用部分数据查找的通用框架上,可以高效地搜索快速选择的数据子集中的冗余数据,而不是整个数据。 PDFS通过包括写入路径优化,数据重复数据删除并行化和写入顺序保留在内的技术来系统地改善IO延迟和吞吐量。这样的设计选择使普通的主要工作负载不堪重负。实验结果表明,与不支持dedup的主流文件系统相比,PDFS达到了理论最大dedup比率的74-99%,并且性能下降非常小,甚至是负面的。还对PDFS的各种配置体验进行了讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号