首页> 外文期刊>ACM transactions on database systems >Optimal Algorithms for Evaluating Rank Joins in Database Systems
【24h】

Optimal Algorithms for Evaluating Rank Joins in Database Systems

机译:数据库系统中排名联接的最佳算法

获取原文
获取原文并翻译 | 示例

摘要

In the rank join problem, we are given a set of relations and a scoring function, and the goal is to return the join results with the top K scores. It is often the case in practice that the inputs may be accessed in ranked order and the scoring function is monotonic. These conditions allow for efficient algorithms that solve the rank join problem without reading all of the input. In this article, we present a thorough analysis of such rank join algorithms. A strong point of our analysis is that it is based on a more general problem statement than previous work, making it more relevant to the execution model that is employed by database systems. One of our results indicates that the well-known HRJN algorithm has shortcomings, because it does not stop reading its input as soon as possible. We find that it is NP-hard to overcome this weakness in the general case, but cases of limited query complexity are tractable. We prove the latter with an algorithm that infers provably tight bounds on the potential benefit of reading more input in order to stop as soon as possible. As a result, the algorithm achieves a cost that is within a constant factor of optimal.
机译:在等级联接问题中,我们得到了一组关系和一个得分函数,目标是返回前K个得分的联接结果。在实践中通常是这样的情况:可以按排名顺序访问输入,并且评分函数是单调的。这些条件允许在不读取所有输入的情况下解决排序连接问题的高效算法。在本文中,我们将对这种排名联接算法进行全面分析。我们的分析的一个强项是,它基于比以前的工作更笼统的问题陈述,从而使其与数据库系统采用的执行模型更加相关。我们的结果之一表明,众所周知的HRJN算法具有缺点,因为它不会尽快停止读取其输入。我们发现在一般情况下克服这种弱点是NP难的,但是查询复杂度有限的情况很容易解决。我们用一种算法证明了后者,该算法可推断出严格的界限,该界限限制了阅读更多输入以便尽快停止的潜在好处。结果,该算法实现了在恒定的最优因子内的成本。

著录项

  • 来源
    《ACM transactions on database systems》 |2010年第1期|6.1-6.47|共47页
  • 作者单位

    Department of Computer Science, University of California at Santa Cruz, 1156 High St., Santa Cruz, CA 95064;

    Department of Computer Science, University of California at Santa Cruz, 1156 High St., Santa Cruz, CA 95064;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    algorithms; performance; theory;

    机译:算法;性能;理论;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号