Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries

Zou Lei; Chen Lei

首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries

【24h】

Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries

机译：基于帕累托的优势图：一种有效的索引结构来回答前K个查询

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Given a record set D and a query score function F, a top-k query returns k records from D, whose values of function F on their attributes are the highest. In this paper, we investigate the intrinsic connection between top-k queries and dominant relationships between records, and based on which, we propose an efficient layer-based indexing structure, Pareto-Based Dominant Graph (DG), to improve the query efficiency. Specifically, DG is built offline to express the dominant relationship between records and top-k query is implemented as a graph traversal problem, i.e., Traveler algorithm. We prove theoretically that the size of search space (that is the number of retrieved records from the record set to answer top-k query) in our algorithm is directly related to the cardinality of skyline points in the record set (see Theorem 3). Considering I/O cost, we propose cluster-based storage schema to reduce I/O cost in Traveler algorithm. We also propose the cost estimation methods in this paper. Based on cost analysis, we propose an optimization technique, pseudorecord, to further improve the search efficiency. In order to handle the top-k query in the high-dimension record set, we also propose N-Way Traveler algorithm. In order to handle DG maintenance efficiently, we propose ȁC;InsertionȁD; and ȁC;DeletionȁD; algorithms for DG. Finally, extensive experiments demonstrate that our proposed methods have significant improvement over its counterparts, including both classical and state art of top-k algorithms.

机译：给定一个记录集D和一个查询得分函数F，前k个查询从D返回k条记录，这些记录在其属性中函数F的值最高。在本文中，我们研究了前k个查询与记录之间的主导关系之间的内在联系，并在此基础上，提出了一种有效的基于层的索引结构，即基于Pareto的优势图（DG），以提高查询效率。具体而言，DG是脱机构建的，用于表达记录之间的主导关系，而top-k查询则作为图遍历问题（即Traveler算法）实现。从理论上讲，我们的算法证明了搜索空间的大小（即从记录集中检索的记录以回答top-k查询的数量）与记录集中天际点的基数直接相关（请参见定理3）。考虑到I / O成本，我们提出了基于集群的存储方案以降低Traveler算法的I / O成本。我们还提出了本文的成本估算方法。基于成本分析，我们提出了一种优化技术伪记录，以进一步提高搜索效率。为了处理高维记录集中的前k个查询，我们还提出了N-Way Traveler算法。为了有效地处理DG维护，我们建议ȁC;InsertionȁD;和ȁC；删除ȁD； DG的算法。最后，大量实验表明，我们提出的方法相对于同类方法（包括传统和top-k算法的最新技术）具有明显的改进。

著录项

来源
《Knowledge and Data Engineering, IEEE Transactions on》 |2011年第5期|p.727-741|共15页
作者
Zou Lei; Chen Lei;
展开▼
作者单位

Peking University, Beijing;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Top-k query; algorithms.; database;

机译：Top-k查询;算法;数据库;

相似文献

外文文献
中文文献
专利

1. GStar: an efficient framework for answering top-k star queries on billion-node knowledge graphs [J] . Jin Jiahui, Luo Junzhou, Khemmarat Samamon, World Wide Web . 2019,第4期

机译：GStar：一个有效的框架，用于回答十亿个节点的知识图上的前k个星查询
2. GStar: an efficient framework for answering top-k star queries on billion-node knowledge graphs [J] . Jin Jiahui, Luo Junzhou, Khemmarat Samamon, World Wide Web . 2019,第4期

机译：GSTAR：以亿节点知识图表回答Top-K Star查询的有效框架
3. Answering top-K query combined keywords and structural queries on RDF graphs [J] . Peng Peng, Zou Lei, Qin Zheng Information Systems . 2017,第JULa期

机译：在RDF图上回答top-K查询组合关键字和结构查询
4. Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries [C] . Lei Zou, Lei Chen International Conference on Data Engineering . 2008

机译：主导图：一个有效的索引结构来回答顶级k查询
5. Locality Sensitive Indexing for Efficient High-Dimensional Query Answering in the Presence of Excluded Regions. [D] . Bhat, Aneesha. 2016

机译：在排除区域中有效的高维查询应答的局部敏感索引。
6. Efficient and Exact Local Search for Random Walk Based Top-K Proximity Query in Large Graphs [O] . Yubao Wu, Ruoming Jin, Xiang Zhang -1

机译：大型图中基于随机游走的Top-K邻近查询的高效且精确的本地搜索
7. Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries [O] . Lei Zou, Lei Chen 2013

机译：基于帕累托的优势图：一种有效的索引结构来回答前K个查询

Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅