首页> 外文期刊>ACM transactions on database systems >A Join-Like Operator to Combine Data Cubes and Answer Queries from Multiple Data Cubes
【24h】

A Join-Like Operator to Combine Data Cubes and Answer Queries from Multiple Data Cubes

机译:类似联接的运算符,用于组合数据多维数据集和来自多个数据多维数据集的查询

获取原文
获取原文并翻译 | 示例

摘要

In order to answer a "joint" query from multiple data cubes, Pourabass and Shoshani [2007] distinguish the data cube on the measure of interest (called the "primary" data cube) from the other data cubes (called "proxy" data cubes) that are used to involve the dimensions (in the query) not in the primary data cube. They demonstrate in study cases that, if the measures of the primary and proxy data cubes are correlated, then the answer to a joint query is an accurate estimate of its true value. Needless to say, for two or more proxy data cubes, the result depends upon the way the primary and proxy data cubes are combined together; however, for certain combination schemes Pourabass and Shoshani provide a sufficient condition, that they call proxy noncommonality, for the invariance of the result. In this article, we introduce: (1) a merge operator combining the contents of a primary data cube with the contents of a proxy data cube, (2) merge expressions for general combination schemes, and (3) an equivalence relation between merge expressions having the same pattern. Then, we prove that proxy noncommonality characterizes patterns for which every two merge expressions are equivalent. Moreover, we provide an efficient procedure for answering joint queries in the special case of perfect merge expressions. Finally, we show that our results apply to data cubes in which measures are obtained from unaggregated data using the aggregate functions SUM, COUNT, MAX, and MIN, and a lot more.
机译:为了回答来自多个数据多维数据集的“联合”查询,Pourabass和Shoshani [2007]将关注度量上的数据多维数据集(称为“主”数据多维数据集)与其他数据多维数据集(称为“代理”数据多维数据集)进行了区分。 ),用于包含(不在查询中的)维度(不在主要数据多维数据集中)。他们在研究案例中证明,如果主数据多维数据集和代理数据多维数据集的度量相关,那么联合查询的答案就是对其真实值的准确估计。不用说,对于两个或多个代理数据多维数据集,结果取决于主数据和代理数据多维数据集组合在一起的方式。但是,对于某些组合方案,Pourabass和Shoshani提供了一个充分的条件,即它们将代理非共性称为结果的不变性。在本文中,我们介绍:(1)将主数据多维数据集的内容与代理数据多维数据集的内容进行组合的合并运算符,(2)通用组合方案的合并表达式,以及(3)合并表达式之间的等价关系具有相同的模式。然后,我们证明代理非共通性表征了每两个合并表达式都是等效的模式。此外,在完美合并表达式的特殊情况下,我们提供了一种有效的程序来回答联合查询。最后,我们证明了我们的结果适用于数据多维数据集,其中使用聚合函数SUM,COUNT,MAX和MIN等从未聚合的数据中获取度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号