An aggregation algorithm using a multidimensional file in multidimensional OLAP

Lee YK.; Whang KY.; Moon YS.; Song Y.

首页> 外文期刊>Information Sciences: An International Journal >An aggregation algorithm using a multidimensional file in multidimensional OLAP

【24h】

An aggregation algorithm using a multidimensional file in multidimensional OLAP

机译：在多维OLAP中使用多维文件的聚合算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aggregation is an operation that plays a key role in multidimensional OLAP (MOLAP). Existing aggregation methods in MOLAP have been proposed for file structures such as multidimensional arrays. These file structures are suitable for data with uniform distributions, but do not work well with skewed distributions. In this paper, we consider an aggregation method that uses dynamic multidimensional files adapting to skewed distributions. In these multidimensional files, the sizes of page regions vary according to the data density in these regions, and the pages that belong to a larger region are accessed multiple times while computing aggregations. To solve this problem, we first present an aggregation computation model that uses the new notions of disjoint-inclusive partition and induced space filling curves. Based on this model, we then present a dynamic aggregation algorithm. Using these notions, the algorithm allows us to maximize the effectiveness of the buffer-we control the page access order in such a way that a page being accessed can reside in the buffer until the next access. We have conducted experiments to show the effectiveness of our approach. Experimental results for a real data set show that the algorithm reduces the number of disk accesses by up to 5.09 times compared with a naive algorithm. The results further show that the algorithm achieves a near optimal performance (i.e., normalized I/O = 1.01) with the total main memory (needed for the buffer and the result table) less than 1.0% of the database size. We believe our work also provides an excellent formal basis for investigating further issues in computing aggregations in MOLAR (C) 2003 Published by Elsevier Science Inc. [References: 19]

机译：聚合是在多维OLAP（MOLAP）中起关键作用的操作。已经提出了MOLAP中用于诸如多维数组的文件结构的现有聚合方法。这些文件结构适用于具有均匀分布的数据，但不适用于偏斜的分布。在本文中，我们考虑一种聚合方法，该方法使用适合于偏斜分布的动态多维文件。在这些多维文件中，页面区域的大小根据这些区域中的数据密度而变化，并且在计算聚合时多次访问属于较大区域的页面。为了解决这个问题，我们首先提出一个聚集计算模型，该模型使用不相交包含分区和诱导空间填充曲线的新概念。然后，基于此模型，我们提出了一种动态聚合算法。使用这些概念，该算法使我们能够最大程度地提高缓冲区的有效性-我们控制页面访问顺序，以使被访问的页面可以驻留在缓冲区中，直到下一次访问为止。我们进行了实验以证明我们方法的有效性。真实数据集的实验结果表明，与朴素算法相比，该算法最多可将磁盘访问次数减少5.09倍。结果还表明，该算法在总主存储器（缓冲区和结果表所需）小于数据库大小的1.0％的情况下，实现了接近最佳的性能（即，标准化I / O = 1.01）。我们相信，我们的工作也为研究进一步聚合计算中的问题（在MOLAR（C）2003中，由Elsevier Science Inc.出版）提供了良好的正式基础。[参考文献：19]

著录项

来源
《Information Sciences: An International Journal》 |2003年第0期|共18页
作者
Lee YK.; Whang KY.; Moon YS.; Song Y.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. An aggregation algorithm using a multidimensional file in multidimensional OLAP [J] . Lee YK., Whang KY., Moon YS., Information Sciences: An International Journal . 2003,第0期

机译：在多维OLAP中使用多维文件的聚合算法
2. Reducing the Multidimensionality of OLAP Cubes with Genetic Algorithms and Multiple Correspondence Analysis [J] . Semeh Ben Salem, Sami Naouali Procedia Computer Science . 2015,第1期

机译：用遗传算法和多重对应分析降低OLAP多维数据集的多维性
3. Algorithms for multidimensional partitioning of static files [J] . Rotem D., Segev A. IEEE Transactions on Software Engineering . 1988,第11期

机译：静态文件的多维分区算法
4. A One-Pass Aggregation Algorithm with the Optimal Buffer Size in Multidimensional OLAP [C] . Young-Koo Lee, Kyu-Young Whang, Yang-Sae Moon, Twenty-eighth International Conference on Very Large Data Bases, Aug 20-23, 2002, Hong Kong SAR, China . 2002

机译：多维OLAP中具有最佳缓冲区大小的单程聚合算法
5. Efficient simple path and Twig query processing algorithms for XML data organized as multidimensional file. [D] . Ali Musleh, Dhiaa Abdulrab. 2010

机译：针对组织为多维文件的XML数据的高效简单路径和Twig查询处理算法。
6. Comparing Two Algorithms for Calibrating the RestrictedNon-Compensatory Multidimensional IRT Model [O] . Chun Wang, Steven W. Nydick 2015

机译：比较两种校准受限算法非补偿多维IRT模型
7. An aggregation algorithm using a multidimensional file in multidimensional OLAP [O] . Young-koo Lee A, Kyu-young Whang A, Yang-sae Moon A, 2001

机译：在多维OLap中使用多维文件的聚合算法

An aggregation algorithm using a multidimensional file in multidimensional OLAP

摘要

著录项

相似文献

相关主题

期刊订阅