首页> 外文期刊>Information and software technology >Multiversion Join Index For Multiversion Data Warehouse
【24h】

Multiversion Join Index For Multiversion Data Warehouse

机译:多版本数据仓库的多版本联接索引

获取原文
获取原文并翻译 | 示例
           

摘要

The data warehouse (DW) technology is developed in order to support the integration of external data sources (EDSs) for the purpose of advanced data analysis by On-Line Analytical Processing (OLAP) applications. Since contents and structures of integrated EDSs may evolve in time, the content and schema of a DW must evolve too in order to correctly reflect the evolution of EDSs. In order to manage a DW evolution, we developed the multiversion data warehouse (MVDW) approach. In this approach, different states of a DW are represented by the sequence of persistent DW versions that correspond either to the real world state or to a simulation scenario. Typically, OLAP applications execute star queries that join multiple fact and dimension tables. An important optimization technique for this kind of queries is based on join indexes. Since in the MVDW fact and dimension data are physically distributed among multiple DW versions, standard join indexes need extensions. In this paper we present the concept of a multiversion join index (MVJI) applicable to indexing dimension and fact tables in the MVDW. The MVJI has a two-level structure, where an upper level is used for indexing attributes and a lower level is used for indexing DW versions. The paper also presents the theoretical upper bound (pessimistic) analysis of the MVJI performance characteristic with respect to I/O operations. The analysis is followed by experimental evaluation. It shows that the MVJI increases a system performance for queries addressing multiple DW versions with exact match and range predicates.
机译:开发数据仓库(DW)技术是为了支持外部数据源(EDS)的集成,以便通过在线分析处理(OLAP)应用程序进行高级数据分析。由于集成EDS的内容和结构可能会随时间变化,因此DW的内容和架构也必须变化,以便正确反映EDS的变化。为了管理DW演进,我们开发了多版本数据仓库(MVDW)方法。在这种方法中,DW的不同状态由对应于现实状态或模拟场景的持久DW版本的序列表示。通常,OLAP应用程序执行将多个事实和维度表联接在一起的星形查询。这种查询的一种重要的优化技术是基于联接索引。由于在MVDW中,事实和维度数据在物理上分布在多个DW版本之间,因此标准联接索引需要扩展。在本文中,我们提出了适用于MVDW中的索引维和事实表的多版本联接索引(MVJI)的概念。 MVJI具有两级结构,其中上层用于索引属性,下层用于索引DW版本。本文还提出了关于I / O操作的MVJI性能特征的理论上限(悲观)分析。分析之后进行实验评估。它显示了MVJI提高了针对具有精确匹配和范围谓词的多个DW版本的查询的系统性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号