...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Finding Top-k Answers in Keyword Search over Relational Databases Using Tuple Units
【24h】

Finding Top-k Answers in Keyword Search over Relational Databases Using Tuple Units

机译:使用元组单元在关系数据库的关键字搜索中查找前k个答案

获取原文
获取原文并翻译 | 示例
           

摘要

Existing studies on keyword search over relational databases usually find Steiner trees composed of connected database tuples as answers. They on-the-fly identify Steiner trees by discovering rich structural relationships between database tuples, and neglect the fact that such structural relationships can be precomputed and indexed. Recently, tuple units are proposed to improve search efficiency by indexing structural relationships, and existing methods identify a single tuple unit to answer keyword queries. However, in many cases, multiple tuple units should be integrated to answer a keyword query. Thus, these methods will involve false negatives. To address this problem, in this paper, we study how to integrate multiple related tuple units to effectively answer keyword queries. To achieve a high performance, we devise two novel indexes, single-keyword-based structure-aware index and keyword-pair-based structure-aware index, and incorporate structural relationships between different tuple units into the indexes. We use the indexes to efficiently identify the answers of integrated tuple units. We develop new ranking techniques and algorithms to progressively find the top-k answers. We have implemented our method in real database systems, and the experimental results show that our approach achieves high search efficiency and result quality, and outperforms state-of-the-art methods significantly.
机译:有关关系数据库中关键字搜索的现有研究通常会找到由连接的数据库元组组成的Steiner树作为答案。他们通过发现数据库元组之间的丰富结构关系来即时识别Steiner树,而忽略了这样的结构关系可以预先计算和索引的事实。最近,提出了元组单元以通过索引结构关系来提高搜索效率,并且现有方法识别单个元组单元来回答关键字查询。但是,在许多情况下,应集成多个元组单元以回答关键字查询。因此,这些方法将涉及假阴性。为了解决这个问题,在本文中,我们研究了如何集成多个相关的元组单元以有效地回答关键字查询。为了实现高性能,我们设计了两个新颖的索引,即基于单关键字的结构感知索引和基于关键字对的结构感知索引,并将不同元组单元之间的结构关系合并到索引中。我们使用索引来有效地识别集成元组单元的答案。我们开发了新的排名技术和算法,以逐步找到前k个答案。我们已经在真实的数据库系统中实现了我们的方法,实验结果表明我们的方法实现了高搜索效率和结果质量,并且明显优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号