首页> 外文会议>ACM international conference on information and knowledge management >Being Picky-Processing Top-K Queries with Set-Defined Selections
【24h】

Being Picky-Processing Top-K Queries with Set-Defined Selections

机译:通过集合定义的选择进行挑剔的前K个查询

获取原文

摘要

Focusing on the top-K items according to a ranking criterion constitutes an important functionality in many different query answering scenarios. The idea is to read only the necessary information-mostly from secondary storage-with the ultimate goal to achieve low latency. In this work, we consider processing such top-K queries under the constraint that the result items are members of a specific set, which is provided at query time. We call this restriction a set-defined selection criterion. Set-defined selections drastically influence the pros and cons of an id-ordered index vs. a score-ordered index. We present a mathematical model that allows to decide at runtime which index to choose, leading to a combined index. To improve the latency around the break even point of the two indices, we show how to benefit from a partitioned score-ordered index and present an algorithm to create such partitions based on analyzing query logs. Further performance gains can be enjoyed using approximate top-K results, with tunable result quality. The presented approaches are evaluated using both real-world and synthetic data.
机译:在许多不同的查询回答方案中,根据排名标准将重点放在前K个项目上,是一项重要的功能。这个想法是只读取必要的信息,主要是从辅助存储中读取信息,其最终目的是实现低延迟。在这项工作中,我们考虑在结果项是特定集合的成员的约束下处理此类前K个查询,这是在查询时提供的。我们将此限制称为集合定义的选择标准。集合定义的选择会极大地影响id排序索引和分数排序索引的优缺点。我们提供了一个数学模型,该模型允许在运行时决定选择哪个索引,从而生成一个组合索引。为了改善围绕两个索引的收支平衡点的延迟,我们展示了如何从分区的得分排序索引中受益,并提出了一种基于分析查询日志来创建此类分区的算法。使用近似top-K的结果可以得到进一步的性能提升,并且结果质量可调。使用实际数据和综合数据对所提出的方法进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号