首页>
外国专利>
Processing device utilizing content-based signature prefix for efficient generation of deduplication estimate
Processing device utilizing content-based signature prefix for efficient generation of deduplication estimate
展开▼
机译:利用基于内容的签名前缀的处理设备,以便高效生成重复数据删除估计
展开▼
页面导航
摘要
著录项
相似文献
摘要
An apparatus in one embodiment comprises at least one processing device comprising a processor coupled to a memory. The processing device is configured to identify a dataset to be scanned to generate a deduplication estimate for that dataset, to designate a content-based signature prefix to be utilized in the scan, and for each of a plurality of pages of the dataset, to scan the page, where scanning the page includes computing a content-based signature for the page, comparing an initial portion of the content-based signature to the designated content-based signature prefix, and responsive to a match between the initial portion and the designated content-based signature prefix, updating a corresponding entry of a deduplication estimate table for the dataset. The processing device generates the deduplication estimate for the dataset based at least in part on contents of the deduplication estimate table.
展开▼