首页> 外文会议>European Conference on Parallel Processing >Parallelization of Massive Multiway Stream Joins on Manycore CPUs
【24h】

Parallelization of Massive Multiway Stream Joins on Manycore CPUs

机译:大规模多路流联接的并行化在Manycore CPU上

获取原文

摘要

Joining a high number of data streams efficiently in terms of required memory and CPU time still poses a challenge. While binary join trees are very common in database systems, they are mostly unusable for streaming queries with tight latency constraints when the number of streaming sources is increasing. Multiway stream joins, on the other hand, are very suitable for this task since they are mostly independent of the non-optimal ordering of join operators or huge intermediate join results. In this paper, we discuss challenges but also opportunities for multi-way stream joins for modern hardware, especially manycore processors. We describe different parallelization and optimization strategies to allow a streaming query to join up to 256 streams on a single CPU while keeping individual tuple response time and also memory footprint low. Our results show that a multiway join can perform magnitudes faster than a binary join tree. In addition, further tuning for efficient parallelism can improve performance again for a factor up to a magnitude.
机译:就所需的内存和CPU时间而言,有效地连接大量数据流仍然是一个挑战。尽管二进制连接树在数据库系统中非常常见,但是当流源数量不断增加时,它们对于延迟严格的流查询几乎不可用。另一方面,多路流连接非常适合此任务,因为它们主要独立于连接运算符的非最佳排序或庞大的中间连接结果。在本文中,我们讨论了挑战,但也讨论了现代硬件(尤其是许多核心处理器)进行多路流连接的机会。我们描述了不同的并行化和优化策略,以允许流查询在单个CPU上加入多达256个流,同时保持单个元组的响应时间和较低的内存占用。我们的结果表明,多路联接的执行幅度比二进制联接树的执行速度快。此外,为有效的并行性而进行的进一步调整可以将性能再次提高一个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号