首页> 外文期刊>ACM transactions on database systems >Optimizing Top-k Queries for Middleware Access: A Unified Cost-Based Approach
【24h】

Optimizing Top-k Queries for Middleware Access: A Unified Cost-Based Approach

机译:优化中间件访问的前k个查询:基于成本的统一方法

获取原文
获取原文并翻译 | 示例

摘要

This article studies optimizing top-k queries in middlewares. While many assorted algorithms have been proposed, none is generally applicable to a wide range of possible scenarios. Existing algorithms lack both the "generality" to support a wide range of access scenarios and the systematic "adaptivity" to account for runtime specifics. To fulfill this critical lacking, we aim at taking a cost-based optimization approach: By runtime search over a space of algorithms, cost-based optimization is general across a wide range of access scenarios, yet adaptive to the specific access costs at runtime. While such optimization has been taken for granted for relational queries from early on, it has been clearly lacking for ranked queries. In this article, we thus identify and address the barriers of realizing such a unified framework. As the first barrier, we need to define a "comprehensive" space encompassing all possibly optimal algorithms to search over. As the second barrier and a conflicting goal, such a space should also be "focused" enough to enable efficient search. For SQL queries that are explicitly composed of relational operators, such a space, by definition, consists of schedules of relational operators (or "query plans"). In contrast, top-k queries do not have logical tasks, such as relational operators. We thus define the logical tasks of top-k queries as building blocks to identify a comprehensive and focused space for top-k queries. We then develop efficient search schemes over such space for identifying the optimal algorithm. Our study indicates that our framework not only unifies, but also outperforms existing algorithms specifically designed for their scenarios.
机译:本文研究如何优化中间件中的前k个查询。尽管已经提出了许多各种各样的算法,但是通常没有一种算法适用于各种可能的情况。现有算法既缺乏支持广泛访问方案的“通用性”,也缺乏系统的“适应性”来考虑运行时的具体情况。为了解决这一关键的不足,我们旨在采用一种基于成本的优化方法:通过在一定算法空间上进行运行时搜索,基于成本的优化通常适用于广泛的访问场景,但可以适应运行时的特定访问成本。尽管从一开始就已将这种优化视为关系查询的理所当然,但显然对于排名查询却缺乏这种优化。因此,在本文中,我们确定并解决了实现这种统一框架的障碍。作为第一个障碍,我们需要定义一个“综合”空间,其中包含要搜索的所有可能的最佳算法。作为第二道障碍和有冲突的目标,也应充分“关注”此类空间以实现有效搜索。对于明确由关系运算符组成的SQL查询,根据定义,这样的空间由关系运算符的时间表(或“查询计划”)组成。相反,前k个查询没有逻辑任务,例如关系运算符。因此,我们将前k个查询的逻辑任务定义为构建块,以识别前k个查询的全面而集中的空间。然后,我们在这种空间上开发有效的搜索方案,以识别最佳算法。我们的研究表明,我们的框架不仅可以统一,而且还可以胜过专门针对其场景设计的现有算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号