首页> 外文期刊>Information Sciences: An International Journal >The partitioned-layer index: Answering monotone top-k queries using the convex skyline and partitioning-merging technique
【24h】

The partitioned-layer index: Answering monotone top-k queries using the convex skyline and partitioning-merging technique

机译:分区层索引:使用凸天际线和分区合并技术回答单调top-k查询

获取原文
获取原文并翻译 | 示例
       

摘要

A top-k query returns k tuples with the highest (or the lowest) scores from a relation. The score is computed by combining the values of one or more attributes. We focus on top-k queries having monotone linear score functions. Layer-based methods are well-known techniques for top-k query processing. These methods construct a database as a single list of layers. Here, the ith layer has the tuples that can be the top-i tuple. Thus, these methods answer top-k queries by reading at most k layers. Query performance, however, is poor when the number of tuples in each layer (simply, the layer size) is large. In this paper, we propose a new layer-ordering method, called the Partitioned-Layer Index (simply, the PL Index), that significantly improves query performance by reducing the layer size. The PL Index uses the notion of partitioning, which constructs a database as multiple sublayer lists instead of a single layer list subsequently reducing the layer size. The PL Index also uses the convex skyline, which is a subset of the skyline, to construct a sublayer to further reduce the layer size. The PL Index has the following desired properties. The query performance of the PL Index is quite insensitive to the weights of attributes (called the preference vector) of the score function and is approximately linear in the value of k. The PL Index is capable of tuning query performance for the most frequently used value of k by controlling the number of sublayer lists. Experimental results using synthetic and real data sets show that the query performance of the PL Index significantly outperforms existing methods except for small values of k (say, k 6 9).
机译:前k个查询从关系中返回得分最高(或最低)的k个元组。通过组合一个或多个属性的值来计算分数。我们专注于具有单调线性得分函数的前k个查询。基于层的方法是用于top-k查询处理的众所周知的技术。这些方法将数据库构造为单个层列表。在这里,第i层具有可以成为top-i元组的元组。因此,这些方法通过最多读取k层来回答前k个查询。但是,当每层中的元组数(仅是层大小)较大时,查询性能就会很差。在本文中,我们提出了一种新的层排序方法,称为分区层索引(简称PL索引),该方法通过减小层大小显着提高了查询性能。 PL索引使用分区的概念,该概念将数据库构造为多个子层列表,而不是单个层列表,从而减小了层大小。 PL索引还使用凸出的天际线(它是天际线的一个子集)来构造子层,以进一步减小层的大小。 PL索引具有以下所需属性。 PL索引的查询性能对得分函数的属性权重(称为偏好向量)非常不敏感,并且k值近似线性。通过控制子层列表的数量,PL索引能够针对最常用的k值调整查询性能。使用合成和真实数据集的实验结果表明,PL索引的查询性能明显优于现有方法,但k值较小(例如k 6 9)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号