【24h】

Stream Operators for Querying Data Streams

机译:查询数据流的流运算符

获取原文
获取原文并翻译 | 示例

摘要

One of the most important uses of aggregate queries over data streams is sampling. Typically, aggregation is performed over sliding windows where queries return new results whenever the window contents change, a concept referred to as a continuous query. Existing data models and query languages for streams are not capable of expressing many practical user-defined samplings over streams. To this end we propose a new data stream model, referred to as the sequence model, and a query language for specifying aggregate queries over data streams. We show that the sequence model can readily express a superset of the aggregate queries expressible in the previously proposed time-based data stream model, thus providing a declarative and formal semantics to understand and reason about continuous aggregate queries. Defined on top of the sequence model, our query language supports existing sliding window operators and a novel frequency operator. By using the frequency operator one is capable of expressing useful sampling queries, such as queries with user-defined group-based sampling and nested aggregation over either the input stream or the result stream. Such capabilities are beyond those of previously proposed query languages over streams. Finally, we conduct a preliminary experimental study that shows our language is effective and efficient in practice.
机译:数据流上聚合查询的最重要用途之一是采样。通常,聚合是在滑动窗口上执行的,每当窗口内容更改时,查询都会返回新结果,这一概念称为连续查询。流的现有数据模型和查询语言无法表达许多实际的用户定义的流采样。为此,我们提出了一种新的数据流模型(称为序列模型)和一种查询语言,用于指定对数据流的聚合查询。我们表明,序列模型可以很容易地表达聚合查询的超集,该聚合查询可以在以前提出的基于时间的数据流模型中表达,从而提供了一种声明性和形式语义,以理解和推理关于连续聚合查询的原因。在序列模型的顶部定义,我们的查询语言支持现有的滑动窗口运算符和新颖的频率运算符。通过使用频率运算符,可以表达有用的采样查询,例如具有用户定义的基于组的采样的查询以及在输入流或结果流上的嵌套聚合。这样的能力超出了先前提出的流查询语言的能力。最后,我们进行了初步的实验研究,表明我们的语言在实践中是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号