首页> 外文学位 >Adaptive query processing in data stream management systems.
【24h】

Adaptive query processing in data stream management systems.

机译:数据流管理系统中的自适应查询处理。

获取原文
获取原文并翻译 | 示例

摘要

Many modern applications need to process data streams that consist of data elements generated in a continuous unbounded fashion. Examples include network monitoring, financial monitoring over stock tickers, sensor processing for environmental monitoring or inventory tracking, telecommunications fraud detection, and others. These applications have spurred interest in a new class of systems, called Data Stream Management Systems (DSMSs), that enable applications to pose long-running continuous queries over data streams.; A fundamental challenge faced by DSMSs is that stream conditions (e.g., data distribution, arrival rate) and system conditions (e.g., query load, memory availability) may vary significantly over the lifetime of a continuous query. When stream or system conditions change, a query execution strategy that was efficient before the change may become very inefficient. Consequently, it is important for a DSMS to support adaptive query processing: The DSMS must be prepared to change the execution plan for a continuous query while the query is running, based on how stream and system conditions change. Without adaptivity, plan performance may drop drastically over time.; This thesis presents a generic framework, called StreaMon, for adaptive query processing in a DSMS. StreaMon has three core components: (i) An Executor, which runs the current plan for each query, (ii) a Profiler, which collects and maintains statistics about current stream and system conditions, and (iii) a Re-optimizer, which ensures that the current plans are the most efficient for current conditions. We instantiate the generic StreaMon framework for three distinct combinations of continuous query type and adaptivity need: (1) Adaptive processing of commutative filters over a stream to maximize throughput at all points in time. (2) Adaptive placement of subresult caches in pipelined plans for windowed stream joins to maximize throughput at all points in time. (3) Detecting relaxed constraints automatically in input streams and exploiting these constraints to reduce memory requirements in plans for windowed stream joins. For each problem, we provide the definition and motivating examples, develop and analyze adaptive algorithms, and present implementation techniques and experimental results from the STREAM general-purpose DSMS prototype developed at Stanford.
机译:许多现代应用程序需要处理数据流,这些数据流包含以连续无限制方式生成的数据元素。示例包括网络监视,股票行情指示器的财务监视,用于环境监视或库存跟踪的传感器处理,电信欺诈检测等。这些应用引起了人们对新型系统的关注,该系统称为数据流管理系统(DSMS),该系统使应用能够对数据流进行长时间连续的查询。 DSMS面临的基本挑战是,在连续查询的整个生命周期内,流条件(例如,数据分布,到达率)和系统条件(例如,查询负载,内存可用性)可能会发生很大的变化。当流或系统条件发生更改时,在更改之前有效的查询执行策略可能会变得非常低效。因此,DSMS必须支持自适应查询处理,这一点很重要:必须准备DSMS,以根据查询流和系统条件的变化,在查询运行时更改连续查询的执行计划。没有适应性,计划绩效可能会随着时间的流逝而急剧下降。本文提出了一个通用框架,称为StreaMon,用于DSMS中的自适应查询处理。 StreaMon具有三个核心组件:(i)执行程序,为每个查询运行当前计划;(ii)事件探查器,收集并维护有关当前流和系统状况的统计信息;以及(iii)重新优化器,用于确保当前计划对于当前条件是最有效的。我们针对连续查询类型和适应性需求的三种不同组合实例化通用StreaMon框架:(1)对流中的交换过滤器进行自适应处理,以最大化所有时间点的吞吐量。 (2)在窗口流连接的流水线计划中自适应放置子结果缓存,以在所有时间点最大化吞吐量。 (3)自动检测输入流中的宽松约束,并利用这些约束来减少窗口流连接计划中的内存需求。对于每个问题,我们提供定义和激励示例,开发和分析自适应算法,并提供斯坦福大学开发的STREAM通用DSMS原型的实现技术和实验结果。

著录项

  • 作者

    Babu, Shivnath.;

  • 作者单位

    Stanford University.;

  • 授予单位 Stanford University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 220 p.
  • 总页数 220
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号