首页> 外文学位 >Efficient Processing of Skyline Queries on Static Data Sources, Data Streams and Incomplete Datasets.
【24h】

Efficient Processing of Skyline Queries on Static Data Sources, Data Streams and Incomplete Datasets.

机译:有效处理静态数据源,数据流和不完整数据集上的天际线查询。

获取原文
获取原文并翻译 | 示例

摘要

Skyline queries extract interesting points that are non-dominated and help paint the bigger picture of the data in question. They are valuable in many multi-criteria decision applications and are becoming a staple of decision support systems.;An assumption commonly made by many skyline algorithms is that a skyline query is applied to a single static data source or data stream. Unfortunately, this assumption does not hold in many applications in which a skyline query may involve attributes belonging to multiple data sources and requires a join operation to be performed before the skyline can be produced. Recently, various skyline-join algorithms have been proposed to address this problem in the context of static data sources. However, these algorithms suffer from several drawbacks: they often need to scan the data sources exhaustively to obtain the skyline-join results; moreover, the pruning techniques employed to eliminate tuples are largely based on expensive tuple-to-tuple comparisons. On the other hand, most data stream techniques focus on single stream skyline queries, thus rendering them unsuitable for skyline-join queries.;Another assumption typically made by most of the earlier skyline algorithms is that the data is complete and all skyline attribute values are available. Due to this constraint, these algorithms cannot be applied to incomplete data sources in which some of the attribute values are missing and are represented by NULL values. There exists a definition of dominance for incomplete data, but this leads to undesirable consequences such as non-transitive and cyclic dominance relations both of which are detrimental to skyline processing.;Based on the aforementioned observations, the main goal of the research described in this dissertation is the design and development of a framework of skyline operators that effectively handles three distinct types of skyline queries: 1) skyline-join queries on static data sources, 2) skyline-window-join queries over data streams, and 3) strata-skyline queries on incomplete datasets. This dissertation presents the unique challenges posed by these skyline queries and addresses the shortcomings of current skyline techniques by proposing efficient methods to tackle the added overhead in processing skyline queries on static data sources, data streams, and incomplete datasets.
机译:天际线查询可提取非支配的有趣点,并帮助绘制有关数据的大图。它们在许多多准则决策应用程序中都很有价值,并且正在成为决策支持系统的重要组成部分。许多天际线算法通常做出的一个假设是,天际线查询应用于单个静态数据源或数据流。不幸的是,这种假设在许多应用中并不成立,在这些应用中,天际线查询可能涉及属于多个数据源的属性,并且需要在生成天际线之前执行联接操作。近来,已经提出了各种skyline-join算法以在静态数据源的情况下解决该问题。但是,这些算法有几个缺点:它们通常需要详尽地扫描数据源以获得“天际线合并”结果;此外,用于消除元组的修剪技术很大程度上基于昂贵的元组间比较。另一方面,大多数数据流技术专注于单流天际线查询,因此使其不适合天际线联接查询。大多数早期天际线算法通常做出的另一个假设是数据是完整的,并且所有天际线属性值都是可用。由于此约束,这些算法无法应用于不完整的数据源,在这些数据源中某些属性值丢失并且由NULL值表示。对于不完整的数据,存在主导的定义,但这会导致不良后果,例如非传递性和周期性的主导关系,这两种都不利于天际线处理。;基于上述观察,本文所述研究的主要目标论文是天际线运算符框架的设计和开发,可以有效处理三种不同类型的天际线查询:1)静态数据源上的天际线联接查询; 2)数据流上的天际线-窗口联接查询; 3)分层-对不完整数据集的天际线查询。本文提出了这些天际线查询所带来的独特挑战,并通过提出有效的方法来解决处理静态数据源,数据流和不完整数据集的天际线查询时增加的开销,从而解决了当前天际线技术的缺点。

著录项

  • 作者

    Nagendra, Mithila.;

  • 作者单位

    Arizona State University.;

  • 授予单位 Arizona State University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 232 p.
  • 总页数 232
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号