首页> 外国专利> Algorithm for dividing a sequence of values into chunks using breakpoints

Algorithm for dividing a sequence of values into chunks using breakpoints

机译:使用断点将值序列划分为块的算法

摘要

A method of dividing a sequence of data into chunks using a sliding window uses an algorithm to compare fingerprint values for each position within the sequence against sets of criteria to create breakpoints. When a fingerprint value does not satisfy a first set of criteria a second set are applied and if satisfied a potential breakpoint is identified. Subsequently, if a fingerprint value that satisfies the first set of criteria is not found before the maximum chunk size is reached the potential breakpoint is designated as a breakpoint. Additional constraints on minimum and maximum sizes of chunks can be used to further refine the method. Further sets of criteria may be used if a fingerprint value does not meet wither of the two initial criteria sets. Preferably the fingerprint value can be identified using Rabin's Fingerprint algorithm.
机译:一种使用滑动窗口将数据序列划分为大块的方法,该算法使用一种算法将序列中每个位置的指纹值与标准集进行比较,以创建断点。当指纹值不满足第一组标准时,应用第二组,并且如果满足,则识别潜在的断点。随后,如果在达到最大块大小之前未找到满足第一组标准的指纹值,则将潜在断点指定为断点。对块的最小和最大大小的附加约束可用于进一步完善该方法。如果指纹值不满足两个初始标准集的要求,则可以使用其他标准集。优选地,可以使用拉宾的指纹算法来识别指纹值。

著录项

  • 公开/公告号GB2450025A

    专利类型

  • 公开/公告日2008-12-10

    原文格式PDF

  • 申请/专利权人 HEWLETT-PACKARD DEVELOPMENT COMPANY L.P.;

    申请/专利号GB20080015775

  • 发明设计人 KAVE ESHGHI;HSIU-KHUERN TANG;

    申请日2005-06-07

  • 分类号G06F17/30;G06F7/72;G06F11/10;G06F12/00;

  • 国家 GB

  • 入库时间 2022-08-21 19:06:42

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号