首页> 外文期刊>World Wide Web >G-Index Model: A generic model of index schemes for top-k spatial-keyword queries
【24h】

G-Index Model: A generic model of index schemes for top-k spatial-keyword queries

机译:G索引模型:用于前k个空间关键字查询的索引方案的通用模型

获取原文
获取原文并翻译 | 示例
           

摘要

A top-k spatial-keyword query returns the k best spatio-textual objects ranked based on their proximity to the query location and relevance to the query keywords. Various index schemes have been proposed for top-k spatial-keyword queries; however, a unified framework covering all these schemes has not been proposed. In this paper, we present a generic model of index schemes for top-k spatial-keyword queries, which we call G-Index Model. First, G-Index Model is a unified framework that exhaustively investigates all the possible index schemes for top-k spatial-keyword queries. For this, we conjecture that data clustering is the key element in composing various index schemes and generate index schemes as combinations of clustering. The result shows that all the existing methods map to those generated by G-Index Model. Using G-Index Model, we also discover two new methods that have not been reported before. Second, we show that G-Index Model is generic, i.e., it can generate index schemes for a class of queries integrating arbitrary multiple data types. For this, we show that G-Index Model can enumerate index schemes for two classes of queries: the spatial-keyword query (without the top-k constraint) and the top-k spatial-keyword-relational query, which adds the relational data type to the top-k spatial-keyword query. Third, we propose a cost model of the generated methods for the top-k spatial-keyword query. Consequently, the cost model allows us to do physical database design so as to find an optimal index scheme for a given usage pattern (i.e., a set of query loads and frequencies). We validate the cost model through extensive experiments.
机译:前k个空间关键字查询返回基于与查询位置的接近程度和与查询关键字的相关性排序的k个最佳时空文本对象。对于前k个空间关键字查询,已经提出了各种索引方案。但是,尚未提出涵盖所有这些方案的统一框架。在本文中,我们介绍了用于前k个空间关键字查询的索引方案的通用模型,称为G-Index模型。首先,G索引模型是一个统一的框架,它对前k个空间关键字查询的所有可能的索引方案进行了详尽的研究。为此,我们推测数据聚类是组成各种索引方案的关键元素,并生成索引方案作为聚类的组合。结果表明,所有现有方法都映射到G-Index模型生成的方法。使用G索引模型,我们还发现了两个以前尚未报告的新方法。其次,我们证明G索引模型是通用的,即它可以为集成任意多个数据类型的一类查询生成索引方案。为此,我们表明G-Index Model可以枚举两类查询的索引方案:空间关键字查询(无top-k约束)和top-k空间关键字关系查询,该查询添加了相关数据输入前k个空间关键字查询。第三,我们提出了用于前k位空间关键字查询的生成方法的成本模型。因此,成本模型使我们能够进行物理数据库设计,以便为给定的使用模式(即一组查询负载和频率)找到最佳的索引方案。我们通过大量实验验证了费用模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号