Adaptive query processing in data stream management systems.

机译：数据流管理系统中的自适应查询处理。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many modern applications need to process data streams that consist of data elements generated in a continuous unbounded fashion. Examples include network monitoring, financial monitoring over stock tickers, sensor processing for environmental monitoring or inventory tracking, telecommunications fraud detection, and others. These applications have spurred interest in a new class of systems, called Data Stream Management Systems (DSMSs), that enable applications to pose long-running continuous queries over data streams.; A fundamental challenge faced by DSMSs is that stream conditions (e.g., data distribution, arrival rate) and system conditions (e.g., query load, memory availability) may vary significantly over the lifetime of a continuous query. When stream or system conditions change, a query execution strategy that was efficient before the change may become very inefficient. Consequently, it is important for a DSMS to support adaptive query processing: The DSMS must be prepared to change the execution plan for a continuous query while the query is running, based on how stream and system conditions change. Without adaptivity, plan performance may drop drastically over time.; This thesis presents a generic framework, called StreaMon, for adaptive query processing in a DSMS. StreaMon has three core components: (i) An Executor, which runs the current plan for each query, (ii) a Profiler, which collects and maintains statistics about current stream and system conditions, and (iii) a Re-optimizer, which ensures that the current plans are the most efficient for current conditions. We instantiate the generic StreaMon framework for three distinct combinations of continuous query type and adaptivity need: (1) Adaptive processing of commutative filters over a stream to maximize throughput at all points in time. (2) Adaptive placement of subresult caches in pipelined plans for windowed stream joins to maximize throughput at all points in time. (3) Detecting relaxed constraints automatically in input streams and exploiting these constraints to reduce memory requirements in plans for windowed stream joins. For each problem, we provide the definition and motivating examples, develop and analyze adaptive algorithms, and present implementation techniques and experimental results from the STREAM general-purpose DSMS prototype developed at Stanford.

机译：许多现代应用程序需要处理数据流，这些数据流包含以连续无限制方式生成的数据元素。示例包括网络监视，股票行情指示器的财务监视，用于环境监视或库存跟踪的传感器处理，电信欺诈检测等。这些应用引起了人们对新型系统的关注，该系统称为数据流管理系统（DSMS），该系统使应用能够对数据流进行长时间连续的查询。 DSMS面临的基本挑战是，在连续查询的整个生命周期内，流条件（例如，数据分布，到达率）和系统条件（例如，查询负载，内存可用性）可能会发生很大的变化。当流或系统条件发生更改时，在更改之前有效的查询执行策略可能会变得非常低效。因此，DSMS必须支持自适应查询处理，这一点很重要：必须准备DSMS，以根据查询流和系统条件的变化，在查询运行时更改连续查询的执行计划。没有适应性，计划绩效可能会随着时间的流逝而急剧下降。本文提出了一个通用框架，称为StreaMon，用于DSMS中的自适应查询处理。 StreaMon具有三个核心组件：（i）执行程序，为每个查询运行当前计划；（ii）事件探查器，收集并维护有关当前流和系统状况的统计信息；以及（iii）重新优化器，用于确保当前计划对于当前条件是最有效的。我们针对连续查询类型和适应性需求的三种不同组合实例化通用StreaMon框架：（1）对流中的交换过滤器进行自适应处理，以最大化所有时间点的吞吐量。（2）在窗口流连接的流水线计划中自适应放置子结果缓存，以在所有时间点最大化吞吐量。（3）自动检测输入流中的宽松约束，并利用这些约束来减少窗口流连接计划中的内存需求。对于每个问题，我们提供定义和激励示例，开发和分析自适应算法，并提供斯坦福大学开发的STREAM通用DSMS原型的实现技术和实验结果。

著录项

作者
Babu, Shivnath.;
展开▼
作者单位

Stanford University.;

展开▼
授予单位 Stanford University.;
学科 Computer Science.
学位 Ph.D.
年度 2005
页码 220 p.
总页数 220
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive mechanism for distributed query processing and data loading using the RDF data in the cloud [J] . Dharmaraj Chandrasekaran Ranichandra, Tripathy BalaKrushna International journal of communication systems . 2018,第15期

机译：使用云中的RDF数据进行分布式查询处理和数据加载的自适应机制
2. Data stream management: processing high-speed data streams. [J] . Paparao Kavalipati Computing reviews . 2017,第3期

机译：数据流管理：处理高速数据流。
3. Time-Stamp Management and Query Execution in Data Stream Management Systems [J] . Yijian Bai, Thakkar H., Haixun Wang, IEEE internet computing . 2008,第6期

机译：数据流管理系统中的时间戳管理和查询执行
4. Adaptive Query Processing in Data Stream Management Systems under Limited Memory Resources [C] . Fatima Farag, Moustafa Hammad, Reda AlHajj 3rd PhD workshop on information and knowledge management 2010 . 2010

机译：有限内存资源下数据流管理系统中的自适应查询处理
5. QoS management for real-time data services and data stream query processing. [D] . Wei, Yuan. 2006

机译：QoS管理，用于实时数据服务和数据流查询处理。
6. An Adaptive Parallel Processing Strategy for Complex Event Processing Systems over Data Streams in Wireless Sensor Networks [O] . Fuyuan Xiao, Masayoshi Aritsugi 2018

机译：无线传感器网络中数据流上复杂事件处理系统的自适应并行处理策略
7. Real-Time, Load-Adaptive Processing of Continuous Queries Over Data Streams [O] . Dhananjay Kulkarni, Chinya V. Ravishankar, Mitch Cherniack 2012

机译：数据流中连续查询的实时，负载自适应处理

Adaptive query processing in data stream management systems.

摘要

著录项

相似文献

相关主题

期刊订阅