...
首页> 外文期刊>Theoretical computer science >A dynamic data structure for top-k queries on uncertain data
【24h】

A dynamic data structure for top-k queries on uncertain data

机译:用于不确定数据的前k个查询的动态数据结构

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In an uncertain data set S = (S, p,f) where S is the ground set consisting of n elements, p : S → [0,1] a probability function, and f : S → R a score function, each element i ∈ S with score f (i) appears independently with probability p (i). The top-k query on S asks for the set of k elements that has the maximum probability of appearing to be the k elements with the highest scores in a random instance of S. Computing the top-k answer on a fixed S is known to be easy. In this paper, we consider the dynamic problem, that is, how to maintain the top-k query answer when S changes, including element insertions and deletions in the ground set S, changes in the probability function p and in the score function/. We present a fully dynamic data structure that handles an update in 0(fc log n) time, and answers a top-j query in O(log n +j) time for any j ≤ k. The structure has 0(n) size and can be constructed in O(n log k) time. As a building block of our dynamic structure, we present an algorithm for the all-top-k problem, that is, computing the top-j answers for all j = 1,..., k, which may be of independent interest.
机译:在不确定数据集中,S =(S,p,f),其中S是由n个元素组成的基础集,p:S→[0,1]是概率函数,而f:S→R是得分函数,每个元素分数为f(i)的i∈S独立出现,概率为p(i)。对S的前k个查询要求一组k元素的集合,该k个元素在S的随机实例中似乎是得分最高的k个元素的可能性最大。已知在固定S上计算前k个答案放轻松。在本文中,我们考虑了动态问题,即,当S发生变化时,如何维持top-k查询答案,包括在基础集S中元素的插入和删除,概率函数p和得分函数/的变化。我们提供了一个完全动态的数据结构,该结构可处理0(fc log n)时间中的更新,并在O(log n + j)时间中回答j≤k的top-j查询。该结构的大小为0(n),可以在O(n log k)的时间内构建。作为动态结构的组成部分,我们提出了一种解决全top-k问题的算法,即计算所有j = 1,...,k的top-j答案,这可能是独立考虑的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号