【24h】

Design and implementation of various file deduplication schemes on storage devices

机译:存储设备上各种文件重复数据删除方案的设计和实现

获取原文
获取原文并翻译 | 示例

摘要

As the smart devices revolutionize, people may generate a lot of data and store the data in the local or remote file system in their daily lives. Even though the novel computer hardware and network technologies can handle the demand of generating a big volume of data, effective file deduplication can save storage space in either the private computing environment or the public cloud system. In the paper, we aim at designing and implementing various file deduplication schemes on storage device, which are based on different duplication checking rules, including file name, file size, and file full/partial content hash value. Comprehensive experiment results show that a partial content hashing based file deduplication can have a better trade-off between the computation cost and deduplication accuracy.
机译:随着智能设备的革命,人们可能会在日常生活中生成大量数据并将其存储在本地或远程文件系统中。即使新颖的计算机硬件和网络技术可以满足生成大量数据的需求,有效的文件重复数据删除仍可以节省私有计算环境或公共云系统中的存储空间。在本文中,我们旨在在存储设备上设计和实现各种文件重复数据删除方案,这些方案基于不同的重复检查规则,包括文件名,文件大小和文件全部/部分内容哈希值。综合实验结果表明,基于部分内容哈希的文件重复数据删除可以在计算成本和重复数据删除精度之间取得更好的折衷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号