首页> 外文会议>International conference on management of data >Optimal Top-k Generation of Attribute Combinations based on Ranked Lists
【24h】

Optimal Top-k Generation of Attribute Combinations based on Ranked Lists

机译:基于排名列表的最佳TOP-K生成属性组合

获取原文

摘要

In this work, we study a novel top-k query type, called top-k m queries. Suppose we are given a set of groups and each group contains a set of attributes, each of which is associated with a ranked list of tuples, with ID and score. All lists are ranked in decreasing order of the scores of tuples. We are interested in finding the best combinations of attributes, each combination involving one attribute from each group. More specifically, we want the top-it combinations of attributes according to the corresponding top-m tuples with matching IDs. This problem has a wide range of applications from databases to search engines on traditional and non-traditional types of data (relational data, XML, text, etc.). We show that a straightforward extension of an optimal top-k algorithm, the Threshold Algorithm (TA), has shortcomings in solving the top-k,m problem, as it needs to compute a large number of intermediate results for each combination and reads more inputs than needed. To overcome this weakness, we provide here, for the first time, a provably instance-optimal algorithm and further develop optimizations for efficient query evaluation to reduce computational and memory costs and the number of accesses. We demonstrate experimentally the scalability and efficiency of our algorithms over three real applications.
机译:在这项工作中,我们研究了一个名为top-k m查询的新型顶级k查询类型。假设我们给出了一组组,每个组都包含一组属性,每个属性与排名的元组列表相关联,具有ID和分数。所有列表都以减少元组的顺序排列。我们有兴趣找到属性的最佳组合,每个组合涉及每个组的一个属性。更具体地说,我们希望根据相应的Top-M元组与匹配ID的相应TOP-IT组合。此问题具有来自数据库的广泛应用程序,以搜索传统和非传统类型的数据类型(关系数据,XML,文本等)。我们表明,最佳Top-K算法的直接扩展,阈值算法(TA)在解决顶级K,M个问题时具有缺点,因为它需要计算每个组合的大量中间结果并读取更多输入比需要。为了克服这种弱点,我们首次提供可怕的实例 - 最佳算法,并进一步开发优化,以便有效地查询评估,以降低计算和内存成本和访问数量。我们展示了我们在三种真实应用中的算法的可扩展性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号