Efficient Top-k Retrieval on Massive Data

Han Xixian; Li Jianzhong; Gao Hong

首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Efficient Top-k Retrieval on Massive Data

【24h】

Efficient Top-k Retrieval on Massive Data

机译：对海量数据的高效Top-k检索

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In many applications, top- query is an important operation to return a set of interesting points in a potentially huge data space. It is analyzed in this paper that the existing algorithms cannot process top- query on massive data efficiently. This paper proposes a novel table-scan-based T2S algorithm to efficiently compute top- results on massive data. T2S first constructs the presorted table, whose tuples are arranged in the order of the round-robin retrieval on the sorted lists. T2S maintains only fixed number of tuples to compute results. The early termination checking for T2S is presented in this paper, along with the analysis of scan depth. The selective retrieval is devised to skip the tuples in the presorted table which are not top- results. The theoretical analysis proves that selective retrieval can reduce the number of the retrieved tuples significantly. The construction and incremental-update/batch-processing methods for the used structures are proposed in this paper. The extensive experimental results, conducted on synthetic and real-life data sets, show that T2S has a significant advantage over the existing algorithms.

机译：在许多应用程序中，顶部查询是一项重要操作，可以在潜在的巨大数据空间中返回一组有趣的点。本文分析了现有算法不能有效地处理海量数据的顶级查询。本文提出了一种新颖的基于表格扫描的T2S算法，可以有效地计算海量数据上的最佳结果。 T2S首先构造预排序表，其元组在循环排序列表上按循环检索的顺序排列。 T2S仅维护固定数量的元组来计算结果。本文介绍了T2S的早期终止检查，以及对扫描深度的分析。设计选择性检索以跳过预排序表中不是最佳结果的元组。理论分析证明，选择性检索可以显着减少元组的检索数量。本文提出了所用结构的构造方法和增量更新/批处理方法。在合成和现实数据集上进行的广泛实验结果表明，T2S相对于现有算法具有明显优势。

著录项

来源
《Knowledge and Data Engineering, IEEE Transactions on》 |2015年第10期|2687-2699|共13页
作者
Han Xixian; Li Jianzhong; Gao Hong;
展开▼
作者单位

School of Computer Science and Technology, Harbin Institute of Technology, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Early termination; Massive data; Selective retrieval; T2S algorithm; Table scan; early termination; selective retrieval; table scan;

机译：提前终止;海量数据;选择性检索;T2S算法;表扫描;提前终止;选择性检索;表扫描;

相似文献

外文文献
中文文献
专利

1. Ranking the big sky: efficient top-k skyline computation on massive data [J] . Han Xixian, Wang Bailing, Li Jianzhong, Knowledge and information systems . 2019,第1期

机译：排名大天空：在大规模数据上有效的Top-K天际线计算
2. Efficient Top-k Dominating Computation on Massive Data [J] . Xixian Han, Jianzhong Li, Hong Gao IEEE Transactions on Knowledge and Data Engineering . 2017,第6期

机译：基于海量数据的高效Top-k支配计算
3. TKAP: Efficiently processing top-k query on massive data by adaptive pruning [J] . Han Xixian, Liu Xianmin, Li Jianzhong, Knowledge and information systems . 2016,第2期

机译：TKAP：通过自适应修剪有效处理海量数据的top-k查询
4. Efficient top-k retrieval on massive data [C] . Xixian Han, Jianzhong Li, Hong Gao IEEE International Conference on Data Engineering . 2016

机译：对海量数据的高效top-k检索
5. Top-k with diversity-m data retrieval in Wireless Sensor Networks [D] . Puram, Kiran Kumar 2014

机译：无线传感器网络中具有多样性-m数据检索的Top-k
6. Efficient Retrieval of Massive Ocean Remote Sensing Images via a Cloud-Based Mean-Shift Algorithm [O] . Mengzhao Yang, Wei Song, Haibin Mei 2017

机译：通过基于云的均值漂移算法高效检索海量海洋遥感图像
7. Efficient Top-K Retrieval with Signatures [O] . Shlomo Geva, Anthony Nguyen, Guido Zuccon 2016

机译：使用签名进行高效的Top-K检索
8. Dynamic Database. Efficiently Convert Massive Quantities of Sensor Data into Actionable Information for Tactical Commanders. [R] . Kessler, O. 2000

机译：动态数据库。有效地将大量传感器数据转换为战术指挥官的可操作信息。

Efficient Top-k Retrieval on Massive Data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅