首页> 外国专利> System and method for dividing data into predominantly fixed-sized chunks so that duplicate data chunks may be identified

System and method for dividing data into predominantly fixed-sized chunks so that duplicate data chunks may be identified

机译:用于将数据划分为主要固定大小的块以便可以识别重复的数据块的系统和方法

摘要

A data chunking system divides data into predominantly fixed-sized chunks such that duplicate data may be identified. The data chunking system may be used to reduce the data storage and save network bandwidth by allowing storage or transmission of primarily unique data chunks. The system may also be used to increase reliability in data storage and network transmission, by allowing an error affecting a data chunk to be repaired with an identified duplicate chunk. The data chunking system chunks data by selecting a chunk of fixed size, then moving a window along the data until a match to existing data is found. As the window moves across the data, unique chunks predominantly of fixed size are formed in the data passed over. Several embodiments provide alternate methods of determining whether a selected chunk matches existing data and methods by which the window is moved through the data. To locate duplicate data, the data chunking system remembers data by computing a mathematical function of a data chunk and inserting the computed value into a hash table.
机译:数据分块系统将数据划分为固定大小的块,以便可以识别重复数据。数据组块系统可用于通过允许存储或传输主要唯一的数据块来减少数据存储并节省网络带宽。该系统还可用于通过允许影响数据块的错误用已标识的重复块来修复来提高数据存储和网络传输的可靠性。数据分块系统通过选择固定大小的块,然后沿数据移动窗口,直到找到与现有数据的匹配,来对数据进行分块。当窗口在数据上移动时,在传递的数据中会形成主要具有固定大小的唯一块。几个实施例提供了确定所选块是否与现有数据匹配的替代方法,以及通过其在数据中移动窗口的方法。为了定位重复数据,数据分块系统通过计算数据块的数学函数并将计算出的值插入哈希表中来记住数据。

著录项

  • 公开/公告号US7281006B2

    专利类型

  • 公开/公告日2007-10-09

    原文格式PDF

  • 申请/专利权人 WINDSOR WEE SUN HSU;SHAUCHI ONG;

    申请/专利号US20030693284

  • 发明设计人 WINDSOR WEE SUN HSU;SHAUCHI ONG;

    申请日2003-10-23

  • 分类号G06F17/30;G06F11/00;

  • 国家 US

  • 入库时间 2022-08-21 21:00:55

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号