首页> 外文学位 >Sliding window query processing over data streams.
【24h】

Sliding window query processing over data streams.

机译:在数据流上进行滑动窗口查询处理。

获取原文
获取原文并翻译 | 示例

摘要

Database management systems (DBMSs) have been used successfully in traditional business applications that require persistent data storage and an efficient querying mechanism. Typically, it is assumed that the data are static, unless explicitly modified or deleted by a user or application. Database queries are executed when issued and their answers reflect the current state of the data. However, emerging applications, such as sensor networks, real-time Internet traffic analysis, and on-line financial trading, require support for processing of unbounded data streams. The fundamental assumption of a data stream management system (DSMS) is that new data are generated continually, making it infeasible to store a stream in its entirety. At best, a sliding window of recently arrived data may be maintained, meaning that old data must be removed as time goes on. Furthermore, as the contents of the sliding windows evolve over time, it makes sense for users to ask a query once and receive updated answers over time.; This dissertation begins with the observation that the two fundamental requirements of a DSMS are dealing with transient (time-evolving) rather than static data and answering persistent rather than transient queries. One implication of the first requirement is that data maintenance costs have a significant effect on the performance of a DSMS. Additionally, traditional query processing algorithms must be re-engineered for the sliding window model because queries may need to re-process expired data and "undo" previously generated results. The second requirement suggests that a DSMS may execute a large number of persistent queries at the same time, therefore there exist opportunities for resource sharing among similar queries.; The purpose of this dissertation is to develop solutions for efficient query processing over sliding windows by focusing on these two fundamental properties. In terms of the transient nature of streaming data, this dissertation is based upon the following insight. Although the data keep changing over time as the windows slide forward, the changes are not random; on the contrary, the inputs and outputs of a DSMS exhibit patterns in the way the data are inserted and deleted. It will be shown that the knowledge of these patterns leads to an understanding of the semantics of persistent queries, lower window maintenance costs, as well as novel query processing, query optimization, and concurrency control strategies. In the context of the persistent nature of DSMS queries, the insight behind the proposed solution is that various queries may need to be refreshed at different times, therefore synchronizing the refresh schedules of similar queries creates more opportunities for resource sharing.
机译:数据库管理系统(DBMS)已成功用于需要持久数据存储和有效查询机制的传统业务应用程序中。通常,假定数据是静态的,除非由用户或应用程序明确修改或删除。数据库查询在发出时执行,其答案反映了数据的当前状态。但是,新兴的应用程序,例如传感器网络,实时Internet流量分析和在线金融交易,需要支持对无限制数据流的处理。数据流管理系统(DSMS)的基本假设是不断生成新数据,因此无法完全存储流。充其量,可以保持最近到达的数据的滑动窗口,这意味着随着时间的流逝必须删除旧数据。此外,随着滑动窗口的内容随时间变化,用户一次查询一次并随时间接收更新的答案就变得很有意义。本文首先观察到DSMS的两个基本要求是处理瞬态(随时间变化)而不是静态数据,并回答持久性查询而不是瞬态查询。第一项要求的一个含义是,数据维护成本会对DSMS的性能产生重大影响。另外,必须为滑动窗口模型重新设计传统的查询处理算法,因为查询可能需要重新处理过期的数据并“撤消”先前生成的结果。第二个要求表明DSMS可以同时执行大量持久查询,因此存在在类似查询之间共享资源的机会。本文的目的是通过着眼于这两个基本特性,为滑动窗口上的高效查询处理开发解决方案。就流数据的瞬时性质而言,本论文基于以下见解。尽管随着窗口向前滑动,数据会随着时间的推移而不断变化,但变化并不是随机的。相反,DSMS的输入和输出表现出插入和删除数据的方式。将显示,对这些模式的了解导致对持久查询的语义的理解,较低的窗口维护成本以及新颖的查询处理,查询优化和并发控制策略。在DSMS查询具有持久性的情况下,提出的解决方案背后的见解是可能需要在不同的时间刷新各种查询,因此,同步类似查询的刷新计划会为资源共享创造更多机会。

著录项

  • 作者

    Golab, Lukasz.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 180 p.
  • 总页数 180
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号