Being Picky-Processing Top-K Queries with Set-Defined Selections

机译：通过集合定义的选择进行挑剔的前K个查询

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Focusing on the top-K items according to a ranking criterion constitutes an important functionality in many different query answering scenarios. The idea is to read only the necessary information-mostly from secondary storage-with the ultimate goal to achieve low latency. In this work, we consider processing such top-K queries under the constraint that the result items are members of a specific set, which is provided at query time. We call this restriction a set-defined selection criterion. Set-defined selections drastically influence the pros and cons of an id-ordered index vs. a score-ordered index. We present a mathematical model that allows to decide at runtime which index to choose, leading to a combined index. To improve the latency around the break even point of the two indices, we show how to benefit from a partitioned score-ordered index and present an algorithm to create such partitions based on analyzing query logs. Further performance gains can be enjoyed using approximate top-K results, with tunable result quality. The presented approaches are evaluated using both real-world and synthetic data.

机译：在许多不同的查询回答方案中，根据排名标准将重点放在前K个项目上，是一项重要的功能。这个想法是只读取必要的信息，主要是从辅助存储中读取信息，其最终目的是实现低延迟。在这项工作中，我们考虑在结果项是特定集合的成员的约束下处理此类前K个查询，这是在查询时提供的。我们将此限制称为集合定义的选择标准。集合定义的选择会极大地影响id排序索引和分数排序索引的优缺点。我们提供了一个数学模型，该模型允许在运行时决定选择哪个索引，从而生成一个组合索引。为了改善围绕两个索引的收支平衡点的延迟，我们展示了如何从分区的得分排序索引中受益，并提出了一种基于分析查询日志来创建此类分区的算法。使用近似top-K的结果可以得到进一步的性能提升，并且结果质量可调。使用实际数据和综合数据对所提出的方法进行了评估。

著录项

来源
《ACM international conference on information and knowledge management》|2012年|912-921|共10页
会议地点
作者
Aleksandar Stupar; Sebastian Michel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
top-K query processing; index partitioning;

机译：top-K查询处理;索引分区;

相似文献

外文文献
中文文献
专利

1. Asymptotically Optimal Encodings of Range Data Structures for Selection and Top-k Queries [J] . Grossi Roberto, Iacono John, Navarro Gonzalo, ACM transactions on algorithms . 2017,第2期

机译：选择和TOP-K查询的范围数据结构的渐近最佳编码
2. Analysis and evaluation of the top-k most influential location selection query [J] . Chen Jian, Huang Jin, Wen Zeyi, Knowledge and information systems . 2015,第1期

机译：前k个最有影响力的位置选择查询的分析和评估
3. A Regression Dependent Iterative Algorithm for Optimizing Top-K Selection in Simulation Query Language [J] . Susan Farley, Alexander Brodsky, Chun-Hung Chen International journal of decision support system technology . 2012,第3期

机译：模拟查询语言中优化Top-K选择的回归相关迭代算法
4. Being Picky-Processing Top-K Queries with Set-Defined Selections [C] . Aleksandar Stupar, Sebastian Michel ACM international conference on information and knowledge management . 2012

机译：挑剔处理的顶部K具有定义选择的查询
5. Top-K Query Processing in Edge-Labeled Graph Data. [D] . Park, Noseong. 2016

机译：边缘标签图形数据中的Top-K查询处理。
6. Geo-Social Top-k and Skyline Keyword Queries on Road Networks [O] . Muhammad Attique, Muhammad Afzal, Farman Ali, 2020

机译：道路网络上的Geo-Social Top-k和Skyline关键字查询
7. Efficient all top-k computation - a unified solution for all top-k, reverse top-k and top-m influential queries [O] . Shen Ge, Leong Hou U, Nikos Mamoulis, 2015

机译：高效的所有top-k计算 - 针对所有top-k，反向top-k和top-m有影响力查询的统一解决方案

Being Picky-Processing Top-K Queries with Set-Defined Selections

摘要

著录项

相似文献

相关主题

期刊订阅