首页> 外文会议>2010 IEEE Fifth International Conference on Networking, Architecture and Storage >A High Effective Indexing and Retrieval Method Providing Block-Level Timely Recovery to Any Point-in-Time
【24h】

A High Effective Indexing and Retrieval Method Providing Block-Level Timely Recovery to Any Point-in-Time

机译:一种高效的索引和检索方法,可将块级及时恢复到任何时间点

获取原文

摘要

Block-level continuous data protection (CDP) logs every disk write operation so that the disk can be rolled back to any arbitrary point-in-time within a time window. For each update operation is time stamped and logged, the indexing for such huge amounts of records is an important and challenging problem. Unfortunately, the conventional indexing methods can not efficiently record large numbers of versions and support instant ȁC;time-travelȁD; types of queries in CDP. In this paper, we present an effective indexing method providing timely recovery to any point-in-time in comprehensive versioning systems, called the Hierarchical Spatial-Temporal Indexing Method (HSTIM). The basic principle of HSTIM is to partition the time domain and the production storage LBAs into time slice and segments respectively according to update frequency of disk IOs, and build separate index file for each segment. In order to meet the demands of instant view of history data, the metadata of production storage is independently indexed. For long-time history data retrieval requirements, index snapshot is introduced in HSTIM to reduce the retrieval time. Another distinctive feature of HSTIM is its incremental retrieval method, which achieves high query performance at time point t + t if neighboring time point t is queried previously. The paper compares HSTIM with traditional B+-tree and multi-version B-tree (MVBT) index in many aspects. Experiments with real workload IO trace files show that HSTIM can locate history data within 8.05 seconds for recovery point of 48 hours, while B+-tree consumes 24.04 seconds. If the index snapshot is applied, HSTIM can reduce such retrieval time within 3 seconds.
机译:块级连续数据保护(CDP)记录每个磁盘写操作,以便可以在时间窗口内将磁盘回滚到任意时间点。对于每个更新操作都带有时间戳和日志记录,为如此大量的记录建立索引是一个重要且具有挑战性的问题。不幸的是,传统的索引方法不能有效地记录大量版本并不能支持即时ȁC;time-travelȁD;。 CDP中的查询类型。在本文中,我们提出了一种有效的索引方法,该方法可在综合版本控制系统中提供及时恢复到任何时间点的功能,这种方法称为层次空间-时间索引方法(HSTIM)。 HSTIM的基本原理是根据磁盘IO的更新频率将时域和生产存储LBA分别划分为时间片和段,并为每个段构建单独的索引文件。为了满足即时查看历史数据的需求,对生产存储的元数据进行了独立索引。对于长期历史数据检索要求,在HSTIM中引入了索引快照以减少检索时间。 HSTIM的另一个显着特征是其增量检索方法,如果先前查询了相邻时间点t,则该方法在时间点t + t处可获得较高的查询性能。本文在许多方面将HSTIM与传统的B +树和多版本B树(MVBT)索引进行了比较。使用实际工作负载IO跟踪文件进行的实验表明,HSTIM可以在8.05秒内找到历史数据,恢复点为48小时,而B +树则为24.04秒。如果应用了索引快照,则HSTIM可以在3秒内减少这种检索时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号