首页> 外文学位 >An interface to grid file method for datacube storage and range sum query.
【24h】

An interface to grid file method for datacube storage and range sum query.

机译:网格文件方法的接口,用于数据多维数据集存储和范围和查询。

获取原文
获取原文并翻译 | 示例

摘要

In this work, an interface to the Grid File for datacube storage and Range sum queries for storing multi-dimensional datacubes for Online Analytical Processing (OLAP) is proposed. The grid file has a natural appeal for the storage of datacubes as it confirms to the grid structure of the datacubes. The proposed approach adapts itself to the insertions of new data by splitting the cells dynamically. It minimizes the drawback of the traditional grid files in dealing with uneven data while enjoys its well known advantages of easy implementation and maintenance. The experiment results shows that the proposed method performs much better than those datacubes implemented by R*- trees in file size, construction time, and query response time for evenly and unevenly distributed data.; In this study, we choose to use the grid file to store the multi-dimensional data cube. The grid file conforms to the data cube naturally. It is simple, does not require a complex index structures, and allows direct access to desired blocks. The data space is partitioned by a grid and each grid cell, which contains a number of data, is stored as a disk block. When a disk block is overflowed, the grid cell splits as necessary. The splitting is dynamic and hierarchical. A grid directory is stored in memory of which each element stores the aggregated value of the measure attributes of the corresponding grid cell. Consequently, only when a grid cell is partially overlapped with the range-sum query, need we access the cell on disk. We have performed extensive experiments on the grid file approach. The results show that our grid file structure not only is simple but also have good performance in disk access and CPU time.
机译:在这项工作中,提出了用于存储数据多维数据集的网格文件接口和用于存储多维数据多维数据集以进行在线分析处理(OLAP)的范围和查询。网格文件对数据立方体的存储具有自然吸引力,因为它可以确认数据立方体的网格结构。所提出的方法通过动态拆分单元格使其适应新数据的插入。它最大程度地减少了传统网格文件在处理不均匀数据方面的缺点,同时享有易于实施和维护的众所周知的优点。实验结果表明,该方法在文件大小,构造时间,查询响应时间等方面均优于由R * -tree实现的数据立方体。在本研究中,我们选择使用网格文件存储多维数据立方体。网格文件自然符合数据立方体。它很简单,不需要复杂的索引结构,并允许直接访问所需的块。数据空间由网格划分,每个包含多个数据的网格单元都存储为磁盘块。当磁盘块溢出时,网格单元将根据需要拆分。划分是动态的和分层的。网格目录存储在内存中,其每个元素都存储相应网格单元的度量属性的合计值。因此,仅当网格单元格与范围和查询部分重叠时,才需要访问磁盘上的单元格。我们已经对网格文件方法进行了广泛的实验。结果表明,我们的网格文件结构不仅简单,而且在磁盘访问和CPU时间方面也具有良好的性能。

著录项

  • 作者

    Mohammed, Imaduddin F.;

  • 作者单位

    Southern Illinois University at Carbondale.$bComputer Science.;

  • 授予单位 Southern Illinois University at Carbondale.$bComputer Science.;
  • 学科 Computer Science.
  • 学位 M.S.
  • 年度 2007
  • 页码 38 p.
  • 总页数 38
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号