Using datacube aggregates for approximate querying and deviation detection

Palpanas T.; Koudas N.; Mendelzon A.

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Using datacube aggregates for approximate querying and deviation detection

【24h】

Using datacube aggregates for approximate querying and deviation detection

机译：使用数据立方体聚合进行近似查询和偏差检测

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Much research has been devoted to the efficient computation of relational aggregations and, specifically, the efficient execution of the datacube operation. In this paper, we consider the inverse problem, that of deriving (approximately) the original data from the aggregates. We motivate this problem in the context of two specific application areas, approximate query answering and data analysis. We propose a framework based on the notion of information entropy that enables us to estimate the original values in a data set, given only aggregated information about it. We then show how approximate queries on the data from which the aggregates were derived can be performed using our framework. We also describe an alternate use of the proposed framework that enables us to identify values that deviate from the underlying data distribution, suitable for data mining purposes. We present a detailed performance study of the algorithms using both real and synthetic data, highlighting the benefits of our approach as well as the efficiency of the proposed solutions. Finally, we evaluate our techniques with a case study on a real data set, which illustrates the applicability of our approach.

机译：许多研究致力于关系聚合的有效计算，尤其是数据多维数据集操作的有效执行。在本文中，我们考虑了反问题，即从集合中派生（近似）原始数据的问题。我们在两个特定的应用领域（近似查询回答和数据分析）中激发了这个问题。我们提出了一个基于信息熵概念的框架，该框架使我们能够估计数据集中的原始值（仅给出有关它的汇总信息）。然后，我们展示了如何使用我们的框架对汇总数据的近似查询。我们还描述了所建议框架的替代用法，该框架使我们能够识别与基础数据分布不同的值，这些值适用于数据挖掘目的。我们使用真实数据和合成数据对算法进行了详细的性能研究，突出了我们方法的优势以及所提出解决方案的效率。最后，我们通过对真实数据集进行案例研究来评估我们的技术，这说明了我们方法的适用性。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2005年第11期|p.1465-1477|共13页
作者
Palpanas T.; Koudas N.; Mendelzon A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
data analysis; data mining; data warehouses; maximum entropy methods; query processing; approximate query answering; data analysis; data distribution; data mining; data warehouse; datacube aggregate; deviation detection; information entropy; inverse problem; Index Ter;

机译：数据分析;数据挖掘;数据仓库;最大熵方法;查询处理;近似查询回答;数据分析;数据分布;数据挖掘;数据仓库;数据立方体集合;偏差检测;信息熵;逆问题;索引;

相似文献

外文文献
中文文献
专利

1. Approximate search algorithm for aggregate k-nearest neighbour queries on remote spatial databases [J] . Hideki Sato, Ryoichi Narita International journal of knowledge and web intelligence . 2013,第1期

机译：远程空间数据库上集合k最近邻查询的近似搜索算法
2. Processing approximate aggregate queries in wireless sensor networks [J] . Antonios Deligiannakis, Yannis Kotidis, Nick Roussopoulos Information Systems . 2006,第8期

机译：在无线传感器网络中处理近似聚合查询
3. C1q deviation test for the detection of immune complexes, aggregates of IgG, and bacterial products in human serum. [J] . A T Sobel, V A Bokisch, H J Müller-Eberhard The Journal of Experomental Medicine . 1975,第1期

机译：C1q偏差测试，用于检测人血清中的免疫复合物，IgG聚集体和细菌产物。
4. Entropy based approximate querying and exploration of datacubes [C] . Palpanas, T., Koudas, . 2001

机译：基于熵的数据立方体近似查询与探索
5. Approximate answering of aggregate queries in relational databases. [D] . Jermaine, Christopher Matthew. 2002

机译：关系数据库中聚合查询的近似答案。
6. C1q deviation test for the detection of immune complexes aggregates of IgG and bacterial products in human serum [O] . 1975

机译：C1q偏差测试用于检测人血清中的免疫复合物IgG聚集体和细菌产物
7. Using Datacube Aggregates for Approximate Querying and Deviation Detection [O] . Themis Palpanas, Nick Koudas, Ieee Computer Society, 2005

机译：使用Datacube aggregates进行近似查询和偏差检测
8. Approximate Equations for Evaluating the Impact Dispersion Resulting from Reentry Winds and Deviations in Density [R] . Glover, L. S. 1970

机译：用于评估再入风和密度偏差引起的冲击扩散的近似方程

Using datacube aggregates for approximate querying and deviation detection

摘要

著录项

相似文献

相关主题

期刊订阅