首页> 外文学位 >Algorithms For Building Compact Representatives And Processing Ranking Queries

【24h】

Algorithms For Building Compact Representatives And Processing Ranking Queries

机译：构建紧凑型代表和处理排名查询的算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Ranked retrieval model has rapidly replaced the traditional Boolean retrieval model as the de facto way for query processing when a large portion of (big) data matches a given query. Returning all the query results in these cases is not efficient nor informative. Unlike the Boolean retrieval model, the ranked retrieval model orders the matching tuples according to an often proprietary ranking function and returns the top-k of them. In this dissertation, we study ranked retrieval model and propose exact and approximate algorithms for (i) building representatives for fast query processing, and (ii) online processing of ranking queries. We study the problem both in the general cases and in the special environment of web databases, a natural fit for the ranked retrieval model.;We start the dissertation by building representatives that serve as indices for ranking query processing. A critical observation is that skyline, also known as Pareto-optimal, (resp. k sky-band) is a set that contains the top-1 (resp. top-k) for every possible ranking function following the monotonic order of attribute values. Thus, first, we study the problem crowdsourcing Pareto-optimal object finding, in the case where objects do not have explicit attributes and preference relations on objects are strict partial orders. Then, we initiate the research into the novel problem of skyline discovery over hidden web databases, which enables a wide variety of innovative third-party applications over one or multiple web databases.;A major problem with the ranking queries representatives, i.e., skyline and convex hull, is that as in real-world applications the representative can be a significant portion of the data, its performance in the ranking query processing is greatly reduced. Thus, computing a subset limited to r tuples that minimize the user's dissatisfaction with the result from the limited set is of interest. We make several fundamental theoretical as well as practical advances in developing such a compact set.;Finally, considering the limitations of top-k indices, while focusing on the client-server databases, we propose query reranking third-party service that uses public interface of the database to enable the on-the-fly processing of ranking queries.

机译：当大部分（大）数据与给定查询匹配时，排名检索模型已迅速取代了传统的布尔检索模型，成为查询处理的实际方法。在这种情况下，返回所有查询结果既没有效率，也没有提供信息。与布尔检索模型不同，分级检索模型根据通常专有的分级函数对匹配的元组进行排序，并返回它们的前k个。在本文中，我们研究了排名检索模型，并提出了精确和近似的算法，用于（i）建立代表以进行快速查询处理，以及（ii）在线处理排名查询。我们在一般情况下和在Web数据库的特殊环境下都研究了该问题，这很自然地适合于排名检索模型。我们通过建立代表作为排名查询处理指标的代表来开始本文。一个关键的观察是，天际线（也称为帕累托最优）（分别为k个天空带）是一个集合，其中包含每个遵循属性值单调顺序的排名函数的top-1（resp.top-k）。。因此，首先，在对象没有显式属性且对象上的偏好关系是严格偏序的情况下，我们研究了众包帕累托最优对象发现的问题。然后，我们开始研究在隐藏的Web数据库上发现天际线的新问题，这使一个或多个Web数据库上的各种创新的第三方应用程序成为可能。排名查询代表的一个主要问题，即skyline和凸包是指，由于在现实世界中的应用程序中代表可以是数据的很大一部分，因此它在排名查询处理中的性能大大降低了。因此，感兴趣的是计算限制于元组的子集，以使用户对有限集合的结果的不满最小化。在开发这种紧凑集方面，我们取得了一些理论上和实践上的进步。最后，考虑到top-k索引的局限性，同时着眼于客户端-服务器数据库，我们建议使用公共接口的查询重新排名第三方服务数据库以实现对排名查询的即时处理。

著录项

作者
Asudeh Naee, Abolfazl.;
展开▼
作者单位

The University of Texas at Arlington.;

展开▼
授予单位 The University of Texas at Arlington.;
学科 Computer science.
学位 Ph.D.
年度 2017
页码 195 p.
总页数 195
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A query language and ranking algorithm for news items in the Hermes news processing framework [J] . Frederik Hogenboom, Damir Vandic, Flavius Frasincar, Science of Computer Programming . 2014,第pta1期

机译：Hermes新闻处理框架中新闻项的查询语言和排名算法
2. Ranking Documents Based on the Semantic Relations Using Analytical Hierarchy Process: Query Expansion and Ranking Process [J] . Ali I. El-Dsouky, Hesham A. Ali, Rabab Samy Rashed International journal of information retrieval research . 2017,第3期

机译：使用层次分析法基于语义关系对文档进行排名：查询扩展和排名过程
3. Fast T-overlap query algorithms using graphics processor units and its applications in web data query [J] . Li Mengjuan, Jia Lianyin, You Jinguo, World Wide Web . 2015,第2期

机译：使用图形处理器单元的快速T重叠查询算法及其在Web数据查询中的应用
4. Performance analysis of combination of fuzzy analytic hierarchy process(FAHP)algorithms with preference ranking organization method for enrichment evaluation algorithm(PROMETHEE II)in the ranking process to determine the increase in employee class [C] . M.Irfan Ramadhan, Muhammad Zarlis, Benny Benyamin Nasution Nommensen International Conference on Technology and Engineering . 2021

机译：模糊分析层次处理（FAHP）算法与偏好排名组织方法的绩效分析评估算法（Promethee II）在排名过程中确定员工课程增加
5. Federated query processing using ontology structure and ranking in a service oriented environment. [D] . Alipanah, Neda. 2012

机译：在面向服务的环境中使用本体结构和排名进行联合查询处理。
6. IJA: An Efficient Algorithm for Query Processing in Sensor Networks [O] . Hyun Chang Lee, Young Jae Lee, Ji Hyang Lim, 2011

机译：IJA：传感器网络中查询处理的高效算法
7. Ranking Query Answers in Probabilistic Databases: Complexity and Efficient Algorithms [O] . Dan Olteanu, Hongkai Wen 2012

机译：在概率数据库中对查询答案进行排名：复杂性和高效算法

Algorithms For Building Compact Representatives And Processing Ranking Queries

摘要

著录项

相似文献

相关主题

期刊订阅