首页> 外文会议>SIGMOD/PODS >SPARK: Top-κ Keyword Query in Relational Databases*
【24h】

SPARK: Top-κ Keyword Query in Relational Databases*

机译:Spark:关系数据库中的Top-κ关键字查询*

获取原文
获取外文期刊封面目录资料

摘要

With the increasing amount of text data stored in relational databases, there is a demand for RDBMS to support keyword queries over text data. As a search result is often assembled from multiple relational tables, traditional IR-style ranking and query evaluation methods cannot be applied directly. In this paper, we study the effectiveness and the effi- ciency issues of answering top-k keyword query in relational database systems. We propose a new ranking formula by adapting existing IR techniques based on a natural notion of virtual document. Compared with previous approaches, our new ranking method is simple yet effiective, and agrees with human perceptions. We also study efficient query processing methods for the new ranking method, and propose algorithms that have minimal accesses to the database. We have conducted extensive experiments on large-scale real databases using two popular RDBMSs. The experimental results demonstrate significant improvement to the alternative approaches in terms of retrieval effectiveness and efficiency.
机译:随着存储在关系数据库中的文本数据的增加,需要RDBMS来支持通过文本数据的关键字查询。作为搜索结果,通常从多个关系表组装,传统的IR式排名和查询评估方法无法直接应用。在本文中,我们研究了关系数据库系统中回答Top-K关键字查询的有效性和效率问题。我们通过基于虚拟文档的自然概念来调整现有的IR技术来提出一个新的排名配方。与先前的方法相比,我们的新排名方法简单而且效力很简单,并同意人类看法。我们还研究了新的排名方法的高效查询处理方法,并提出了对数据库进行最小访问的算法。我们在使用两个流行的RDBMSS上对大型真实数据库进行了广泛的实验。实验结果表明,在检索效率和效率方面对替代方法的显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号