首页> 外文会议>IEEE International Conference on Data Mining Workshops >Reporting L Most Favorite Objects in Uncertain Databases with Probabilistic Reverse Top-k Queries
【24h】

Reporting L Most Favorite Objects in Uncertain Databases with Probabilistic Reverse Top-k Queries

机译:使用概率反向Top-k查询报告不确定数据库中的L个最喜欢的对象

获取原文

摘要

Top-k queries are widely studied for identifying a ranked set of the k most interesting objects based on the individual user preference. Reverse top-k queries are proposed from the perspective of the product manufacturer, which are essential for manufacturers to assess the potential market and impacts of their products. However, the existing approaches for reverse top-k queries are all based on the assumption that the underlying data are exact. Due to the intrinsic differences between uncertain and certain data, these methods are designed only in certain databases and cannot be applied to uncertain case directly. Motivated by this, in this paper, we firstly model the probabilistic reverse top-k queries in the context of uncertain data. Moreover, we formulate the challenging problem of processing queries that report l most favorite objects to users, where impact factor of an object is defined as the cardinality of the probabilistic reverse top-k query result set. For speeding up the query, we exploit several properties of probabilistic threshold top-k queries and probabilistic skyline queries to reduce the solution space of this problem. In addition, an upper bound of the potential users is estimated to reduce the cost of computing the probabilistic reverse top-k queries for the candidate objects. Furthermore, effective pruning heuristics are presented to further reduce the search space of query processing. Finally, efficient query algorithms are presented seamlessly with integration of the proposed pruning strategies. Extensive experiments demonstrate the efficiency and effectiveness of our proposed algorithms with various experimental settings.
机译:对前k个查询进行了广泛的研究,以基于单个用户的偏好来确定k个最有趣的对象的排名集。反向top-k查询是从产品制造商的角度提出的,这对于制造商评估潜在市场和其产品的影响至关重要。但是,反向top-k查询的现有方法都是基于基础数据准确的假设。由于不确定数据和某些数据之间的固有差异,这些方法仅在某些数据库中设计,不能直接应用于不确定情况。因此,本文首先在不确定数据的背景下,对概率反向top-k查询进行建模。此外,我们提出了一个具有挑战性的问题,即处理向用户报告最喜欢的对象的查询,其中对象的影响因子定义为概率反向top-k查询结果集的基数。为了加快查询速度,我们利用概率阈值top-k查询和概率天际线查询的一些属性来减少此问题的解决空间。另外,估计潜在用户的上限以减少计算针对候选对象的概率反向top-k查询的成本。此外,提出了有效的修剪启发法,以进一步减少查询处理的搜索空间。最后,结合所提出的修剪策略无缝地提出了有效的查询算法。大量实验证明了我们在各种实验设置下提出的算法的效率和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号