首页> 外文OA文献 >Efficient processing of spatial queries over uncertain database
【2h】

Efficient processing of spatial queries over uncertain database

机译:不确定数据库上空间查询的高效处理

摘要

Uncertainty is inherent in many important applications, and many important queries are re-investigated in the context of uncertain data models. Efficient algorithms are strongly demanded to analyze spatial uncertain data.This thesis studies four fundamental problems to analyze spatial uncertain data by proposing efficient query processing algorithms, including (1) find top k influential facilities, (2) identify top k dominating objects, (3) range search on uncertain trajectories, and (4) top k similarity join.Firstly, we study the problem of finding top k most influential facilities over uncertain objects. We propose a new ranking model to identify the top k most influential facilities, which captures influence of facilities on the uncertain objects. Effective and efficient algorithms are proposed following the filtering-verification paradigm by utilizing two uncertain object indexing techniques. To effectively support uncertain objects with a large number of instances, we further develop randomized algorithms with accuracy guarantee.Secondly, we study the problem of top k dominating query on uncertain data, which is an essential method in the multi-criteria decision analysis when an explicit scoring function is not available. We formally introduce the top k dominating model, and propose effective and efficient algorithms to identify the top k dominating objects. Novel pruning techniques are proposed by utilizing the spatial indexing and statistic information to reduce CPU and I/O costs.Thirdly, we investigate the problem of range search on uncertain trajectories by assuming uncertain trajectories are modeled by the Markov Chains. We propose a general framework for range search on uncertain trajectories following the filtering-refinement paradigm where summaries of uncertain trajectories are constructed to facilitate the filtering process. Statistics based and partition based filtering techniques are developed to enhance the filtering capabilities.Finally, we investigate the problem of top k similarity join over multi-valued objects. We apply two types of quantile based distance measures to explore the relative instance distribution among the multiple instances of objects. Following a filtering-refinement framework, efficient and effective techniques to process top k similarity joins over multi-valued objects are developed. Novel distance, statistic and weight based pruning techniques are proposed to speed up the computations.
机译:在许多重要的应用程序中,不确定性是固有的,并且在不确定的数据模型的背景下,对许多重要的查询进行了重新调查。迫切需要一种有效的算法来分析空间不确定性数据。本文通过提出有效的查询处理算法来研究分析空间不确定性数据的四个基本问题,包括(1)找到前k个有影响力的设施,(2)确定前k个主要控制对象,(3 )对不确定轨迹的范围搜索,以及(4)top k相似性联接。首先,我们研究了在不确定对象上找到top k最有影响力的设施的问题。我们提出了一种新的排名模型来确定前k个最具影响力的设施,该模型可以捕获设施对不确定对象的影响。通过利用两种不确定的对象索引技术,提出了一种有效且高效的算法,遵循了过滤验证范式。为了有效地支持大量实例的不确定对象,我们进一步开发了具有精度保证的随机算法。其次,研究了不确定性数据的前k个支配查询问题,这是多准则决策分析中不可或缺的一种方法。显式评分功能不可用。我们正式介绍了前k个主要控制模型,并提出了有效的算法来识别前k个主要控制对象。通过利用空间索引和统计信息来减少CPU和I / O成本,提出了新颖的修剪技术。第三,我们假设不确定轨迹由Markov链建模,从而研究了不确定轨迹的范围搜索问题。我们提出了一个通用的框架,用于遵循滤波细化范式对不确定轨迹进行范围搜索,在此结构中,不确定轨迹的摘要被构建以促进滤波过程。最终,我们研究了基于统计和基于分区的过滤技术,以增强过滤能力。最后,我们研究了多值对象的前k个相似性连接问题。我们应用两种基于分位数的距离度量来探索对象的多个实例之间的相对实例分布。遵循筛选优化框架,开发了有效且有效的技术来处理多值对象上的前k个相似性联接。提出了基于距离,统计和权重的新型修剪技术,以加快计算速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号