首页>
外国专利>
Dictionary architecture and methodology for revision-tolerant data de-duplication
Dictionary architecture and methodology for revision-tolerant data de-duplication
展开▼
机译:允许修订的重复数据删除的字典体系结构和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Redundant data is removed from a volume of data by partitioning the volume of data into fixed-length input segments and, for each of the input segments, traversing nodes of a search tree in accordance with the value of a fixed-size portion of the input segment to determine if the search tree contains a pointer to a matching fixed-sized portion of a segment within a dictionary. If the search tree contains the pointer, the input segment is compared with the segment within the dictionary pointed to by the pointer, and a token representative of the segment within the dictionary is substituted for at least part of the input segment determined to match the segment within the dictionary.
展开▼