首页> 外文会议>International conference on very large data bases;VLDB 2008 >Out-of-Order Processing: A New Architecture for High-Performance Stream Systems
【24h】

Out-of-Order Processing: A New Architecture for High-Performance Stream Systems

机译:乱序处理:高性能流系统的新架构

获取原文

摘要

Many stream-processing systems enforce an order on data streams during query evaluation to help unblock blocking operators and purge state from stateful operators. Such in-order processing (IOP) systems not only must enforce order on input streams, but also require that query operators preserve order. This order-preserving requirement constrains the implementation of stream systems and incurs significant performance penalties, particularly for memory consumption. Especially for high-performance, potentially distributed stream systems, the cost of enforcing order can be prohibitive. We introduce a new architecture for stream systems, out-of-order processing (OOP), that avoids ordering constraints. The OOP architecture frees stream systems from the burden of order maintenance by using explicit stream progress indicators, such as punctuation or heartbeats, to unblock and purge operators. We describe the implementation of OOP stream systems and discuss the benefits of this architecture in depth. For example, the OOP approach has proven useful for smoothing workload bursts caused by expensive end-of-window operations, which can overwhelm internal communication paths in IOP approaches. We have implemented OOP in two stream systems, Gigascope and NiagaraST. Our experimental study shows that the OOP approach can significantly outperform IOP in a number of aspects, including memory, throughput and latency.
机译:许多流处理系统在查询评估期间对数据流强制执行顺序,以帮助解除阻塞运算符并从有状态运算符清除状态。这样的按顺序处理(IOP)系统不仅必须对输入流强制执行顺序,而且还要求查询运算符保留顺序。这种保留顺序的要求限制了流系统的实现,并导致明显的性能损失,特别是在内存消耗方面。尤其是对于高性能,潜在分布的流系统,强制执行命令的成本可能会令人望而却步。我们为流系统引入了一种新的体系结构,即无序处理(OOP),它避免了排序约束。 OOP体系结构通过使用明确的流进度指示器(例如标点或心跳)来解除阻塞和清除操作员,从而使流系统摆脱了订单维护的负担。我们描述了OOP流系统的实现,并深入讨论了此体系结构的好处。例如,事实证明,OOP方法对于消除由昂贵的窗口结束操作引起的工作负载突发很有用,因为这些操作可能会使IOP方法中的内部通信路径不堪重负。我们已经在两个流系统Gigascope和NiagaraST中实现了OOP。我们的实验研究表明,OOP方法在很多方面都可以大大超过IOP,包括内存,吞吐量和延迟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号