首页> 外文会议>ACM SIGMOD international conference on management of data >Threshold Query Optimization for Uncertain Data
【24h】

Threshold Query Optimization for Uncertain Data

机译:不确定数据的阈值查询优化

获取原文

摘要

The probabilistic threshold query (PTQ) is one of the most common queries in uncertain databases, where all results satisfying the query with probabilities that meet the threshold requirement are returned. PTQ is used widely in nearest-neighbor queries, range queries, ranking queries, etc. In this paper, we investigate the general PTQ for arbitrary SQL queries that involve selections, projections and joins. The uncertain database model that we use is one that combines both attribute and tuple uncertainty as well as correlations between arbitrary attribute sets. We address the PTQ optimization problem that aims at improving the efficiency of PTQ query execution by enabling alternative query plan enumeration for optimization. We propose general optimization rules as well as rules specifically for selections, projections and joins. We introduce a threshold operator (τ-operator) to the query plan and show it is generally desirable to push down the τ-operator as much as possible. Our PTQ optimizations are evaluated in a real uncertain database management system. Our experiments on both real and synthetic data sets show that the optimizations improve the PTQ query processing time.
机译:概率阈值查询(PTQ)是不确定的数据库中最常见的查询,在满足与符合阈值的要求概率查询的所有结果返回之一。 PTQ使用广泛的近邻查询,范围查询,排名查询,等等。在本文中,我们探讨涉及的选择,预测并加入任意SQL查询的通用PTQ。不确定的数据库模型,我们用的是一个结合了属性和元组的不确定性,以及任意属性集之间的相关性。我们解决PTQ优化问题,其目的是通过实现替代查询计划的枚举优化来提高PTQ查询执行的效率。我们建议一般优化规则以及规则专门为选择,投影和连接。我们引入阈值运算符(τ运营商)查询计划,并显示它是通常希望τ运营商推下尽可能多地。我们PTQ优化在一个真正的不确定数据库管理系统进行评估。我们在真实和合成数据集的实验表明,优化提高PTQ查询处理时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号