...
首页> 外文期刊>Journal of Intelligent Information Systems >Efficient pruning for top-K ranking queries on attribute-wise uncertain datasets
【24h】

Efficient pruning for top-K ranking queries on attribute-wise uncertain datasets

机译:在属性不确定的数据集上对前K个排名查询的有效修剪

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Top-K ranking queries in uncertain databases aim to find the top-K tuples according to a ranking function. The interplay between score and uncertainty makes top-K ranking in uncertain databases an intriguing issue, leading to rich query semantics. Recently, a unified ranking framework based on parameterized ranking functions (PRFs) has been formulated, which generalizes many previously proposed ranking semantics. Under the PRFs based ranking framework, efficient pruning approach for Top-K ranking on datasets with tuple-wise uncertainty has been well studied in the literature. However, this cannot be applied to top-K ranking on datasets with attribute-wise uncertainty, which are often natural and useful in analyzing uncertain data in many applications. This paper aims to develop efficient pruning techniques for top-K ranking on datasets with attribute-wise uncertainty under the PRFs based ranking framework, which has not been well studied in the literature. We first develop a Tuple Insertion Based Algorithm for computing each tuple's PRF value, which reduce the time cost from the state of the art cubic order of magnitude to quadratic order of magnitude. Based on the Tuple Insertion Based Algorithm, three pruning strategies are developed to further reduce the time cost. The mathematics of deriving the Tuple Insertion Based Algorithm and corresponding pruning strategies are also presented. At last, we show that our pruning algorithms can also be applied to the computation of the top-k aggregate queries. The experimental results on both real and synthetic data demonstrate the effectiveness and efficiency of the proposed pruning techniques.
机译:不确定数据库中的前K个排名查询旨在根据排名函数查找前K个元组。得分与不确定性之间的相互作用使不确定数据库中的前K名成为一个有趣的问题,从而导致了丰富的查询语义。最近,已经制定了基于参数化排名函数(PRF)的统一排名框架,该框架归纳了许多先前提出的排名语义。在基于PRF的排序框架下,对具有元组不确定性的数据集进行Top-K排序的有效修剪方法已经在文献中进行了深入研究。但是,这不能应用于具有属性方式不确定性的数据集的前K个排名,这在许多应用程序中分析不确定性数据时通常是自然而有用的。本文旨在开发一种有效的修剪技术,用于在基于PRF的排名框架下对具有属性方式不确定性的数据集进行top-K排名,这在文献中尚未得到很好的研究。我们首先开发了一种基于元组插入的算法来计算每个元组的PRF值,从而将时间成本从现有的立方数量级降低到二次数量级。基于基于元组插入的算法,开发了三种修剪策略以进一步减少时间成本。还介绍了推导基于元组插入的算法的数学方法和相应的修剪策略。最后,我们证明了我们的修剪算法也可以应用于前k个聚集查询的计算。真实和综合数据的实验结果证明了所提出的修剪技术的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号