首页> 外文期刊>Datenbank-Spektrum >Lock-free Data Structures for Data Stream Processing A Closer Look
【24h】

Lock-free Data Structures for Data Stream Processing A Closer Look

机译:用于数据流处理的无锁数据结构

获取原文
获取原文并翻译 | 示例
           

摘要

Processing data in real-time instead of storing and reading from tables has led to a specialization of DBMS into the so-called data stream processing paradigm. While high throughput and low latency are key requirements to keep up with varying stream behavior and to allow fast reaction to incoming events, there are many possibilities how to achieve them. In combination with modern hardware, like server CPUs with tens of cores, the parallelization of stream queries for multithreading and vectorization is a common schema. High degrees of parallelism, however, need efficient synchronization mechanisms to allow good scaling with threads for shared memory access.In this work, we identify the most time-consuming operations for stream processing exemplarily for our own stream processing engine PipeFabric. In addition, we present different design principles of lock-free data structures which are suited to overcome those bottlenecks. We will finally demonstrate how lock-freedom greatly improves performance for join processing and tuple exchange between operators under different workloads. Nevertheless, the efficient usage of lock-free data structures comes with additional efforts and pitfalls, which we also discuss in this paper.
机译:实时处理数据而不是存储和读取表已导致DBMS专门化为所谓的数据流处理范例。尽管高吞吐量和低延迟是紧跟变化的流行为并允许对传入事件做出快速反应的关键要求,但仍有许多实现方法。与现代硬件(例如具有数十个内核的服务器CPU)结合使用时,用于多线程和矢量化的流查询的并行化是一种常见的模式。但是,高度的并行性需要高效的同步机制,以允许线程具有良好的伸缩性,以便共享内存访问。在这项工作中,我们为自己的流处理引擎PipeFabric示例性地确定了流处理中最耗时的操作。此外,我们提出了适用于克服这些瓶颈的无锁数据结构的不同设计原理。最后,我们将展示无锁如何在不同的工作负载下极大地提高操作员之间的联接处理和元组交换性能。然而,无锁数据结构的有效使用需要付出额外的努力和陷阱,我们还将在本文中进行讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号