Incremental Query Processing on Big Data Streams

Leonidas Fegaras

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Incremental Query Processing on Big Data Streams

【24h】

Incremental Query Processing on Big Data Streams

机译：大数据流上的增量查询处理

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper addresses online query processing for large-scale, incremental data analysis on a distributed stream processing engine (DSPE). Our goal is to convert any SQL-like query to an incremental DSPE program automatically. In contrast to other approaches, we derive incremental programs that return accurate results, not approximate answers, by retaining a minimal state during the query evaluation lifetime and by using a novel incremental evaluation technique, which, at each time interval, returns an accurate snapshot answer that depends on the current state and the latest batches of data. Our methods can handle many forms of queries on nested data collections, including iterative and nested queries, group-by with aggregation, and equi-joins. Finally, we report on a prototype implementation of our framework, called MRQL Streaming, running on top of Spark and we experimentally validate the effectiveness of our methods.

机译：本文介绍了用于分布式流处理引擎（DSPE）上的大规模增量数据分析的在线查询处理。我们的目标是将任何类似SQL的查询自动转换为增量DSPE程序。与其他方法相比，我们通过在查询评估生命周期内保持最小状态并使用新颖的增量评估技术（在每个时间间隔返回准确的快照答案），得出返回准确结果而不是近似答案的增量程序。这取决于当前状态和最新批次的数据。我们的方法可以处理对嵌套数据集合的多种形式的查询，包括迭代和嵌套查询，具有聚合的分组依据和等联接。最后，我们报告了在Spark之上运行的称为MRQL流的框架原型实现，并通过实验验证了方法的有效性。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2016年第11期|2998-3012|共15页
作者
Leonidas Fegaras;
展开▼
作者单位

University of Texas at Arlington, Arlington, TX;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparks; Query processing; Big data; Data analysis; Silicon; Digital signal processing; Database languages;

机译：火花;查询处理;大数据;数据分析;硅;数字信号处理;数据库语言;

相似文献

外文文献
中文文献
专利

1. Incremental Evaluation of Sliding-Window Queries over Data Streams [J] . Ghanem T.M., Hammad M.A., Mokbel M.F., IEEE Transactions on Knowledge and Data Engineering . 2007,第期

机译：数据流上的滑动窗口查询的增量评估
2. Correction to: Semantic annotation of summarized sensor data stream for effective query processing [J] . Pacha Shobharani, Murugan Suresh Ramalingam, Sethukarasi R. Journal of supercomputing . 2020,第6期

机译：校正：关于有效查询处理的总结传感器数据流的语义注释
3. Semantic annotation of summarized sensor data stream for effective query processing [J] . Pacha Shobharani, Murugan Suresh Ramalingam, Sethukarasi R. Journal of supercomputing . 2020,第6期

机译：用于有效查询处理的总结传感器数据流的语义注释
4. Incrementally-Updatable Stream Processors for XPath Queries based on Merging Automata via Ordered Hash-keys [C] . Takekawa, H., Ishikawa, Database and Expert Systems Applications (DEXA), 2007 18th International Conference on . 2007

机译：基于通过有序哈希键合并自动机的XPath查询的增量可更新流处理器
5. Efficient Processing of Skyline Queries on Static Data Sources, Data Streams and Incomplete Datasets. [D] . Nagendra, Mithila. 2014

机译：有效处理静态数据源，数据流和不完整数据集上的天际线查询。
6. Streaming chunk incremental learning for class-wise data stream classification with fast learning speed and low structural complexity [O] . Prem Junsawang, Suphakant Phimoltares, Chidchanok Lursinsap 2012

机译：流式块增量学习，用于以快速的学习速度和较低的结构复杂度对类数据流进行分类
7. In-Memory Based Incremental Processing Method for Stream Query Processing in Big Data Environments [O] . Kyoungsoo Bok, Misun Yook, Yeonwoo Noh, 2016

机译：基于内存的中存储器递增处理方法，用于大数据环境中的流查询处理

Incremental Query Processing on Big Data Streams

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅