首页> 外文会议>International conference on management of data >Optimal Top-k Generation of Attribute Combinations based on Ranked Lists
【24h】

Optimal Top-k Generation of Attribute Combinations based on Ranked Lists

机译:基于排名列表的最优Top-k属性组合生成

获取原文

摘要

In this work, we study a novel top-k query type, called top-k m queries. Suppose we are given a set of groups and each group contains a set of attributes, each of which is associated with a ranked list of tuples, with ID and score. All lists are ranked in decreasing order of the scores of tuples. We are interested in finding the best combinations of attributes, each combination involving one attribute from each group. More specifically, we want the top-it combinations of attributes according to the corresponding top-m tuples with matching IDs. This problem has a wide range of applications from databases to search engines on traditional and non-traditional types of data (relational data, XML, text, etc.). We show that a straightforward extension of an optimal top-k algorithm, the Threshold Algorithm (TA), has shortcomings in solving the top-k,m problem, as it needs to compute a large number of intermediate results for each combination and reads more inputs than needed. To overcome this weakness, we provide here, for the first time, a provably instance-optimal algorithm and further develop optimizations for efficient query evaluation to reduce computational and memory costs and the number of accesses. We demonstrate experimentally the scalability and efficiency of our algorithms over three real applications.
机译:在这项工作中,我们研究了一种新颖的top-k查询类型,称为top-k m查询。假设给定了一组组,每个组包含一组属性,每个属性都与具有ID和得分的元组的已排序列表相关联。所有列表按元组分数的降序排列。我们有兴趣寻找最佳的属性组合,每种组合涉及每个组中的一个属性。更具体地说,我们希望根据具有匹配ID的相应top-m元组的属性的top-it组合。从数据库到搜索引擎,此问题在传统和非传统类型的数据(关系数据,XML,文本等)上都有广泛的应用。我们展示了最佳top-k算法(阈值算法(TA))的直接扩展在解决top-k,m问题方面存在缺陷,因为它需要为每种组合计算大量中间结果并读取更多内容。输入超出所需。为了克服这一弱点,我们在这里首次提供了可证明的实例最优算法,并进一步开发了用于有效查询评估的优化,以减少计算和内存成本以及访问次数。我们通过实验证明了我们的算法在三个实际应用中的可扩展性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号