首页> 外文期刊>Knowledge and information systems >A graph-theoretic approach to optimize keyword queries in relational databases
【24h】

A graph-theoretic approach to optimize keyword queries in relational databases

机译:图论方法在关系数据库中优化关键字查询

获取原文
获取原文并翻译 | 示例
       

摘要

Keyword search can provide users an easy method to query large and complex databases without any knowledge of structured query languages or underlying database schema. Most of the existing studies have focused on generating candidate structured queries relevant to keywords. Due to the large size of generated queries, the execution costs may be prohibitive. However, existing studies lack the idea of a generalized method to optimize the plan of the large set of generated queries. In this paper, we introduce a graph-theoretic optimization approach. We propose a general graph model, Weighted Operator Graph, to address the costs of keyword query evaluation plans. The proposed model is flexible to integrate all of the cost-based plans in a uniform way. We define a Keyword Query Optimization Problem based on a theoretical cost model as a graph-theoretic problem and show it to be a NP-hard problem. We propose a greedy heuristic Maximum Propagation that reduces the size of the intermediate result as early as possible. The proposed algorithm allows us to achieve efficiency in terms of query evaluation costs. The experimental studies on both synthetic and real data set results show that our work outperforms the existing work.
机译:关键字搜索可以为用户提供一种简单的查询大型复杂数据库的方法,而无需任何结构化查询语言或基础数据库架构的知识。现有的大多数研究都集中在生成与关键字相关的候选结构化查询。由于生成的查询的大小很大,因此执行成本可能会很高。但是,现有研究缺乏通用方法来优化生成的大量查询的计划的想法。在本文中,我们介绍了一种图论优化方法。我们提出了一种通用的图形模型,即加权算子图,以解决关键字查询评估计划的成本。提议的模型可以灵活地以统一的方式集成所有基于成本的计划。我们将基于理论成本模型的关键字查询优化问题定义为图论问题,并将其显示为NP难问题。我们提出了一种贪婪的启发式最大传播算法,该算法可以尽早减小中间结果的大小。所提出的算法使我们能够在查询评估成本方面实现效率。对综合和真实数据集结果的实验​​研究表明,我们的工作优于现有工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号