首页> 外文期刊>ACM transactions on database systems >Design and Evaluation of an RDMA-aware Data Shuffling Operator for Parallel Database Systems
【24h】

Design and Evaluation of an RDMA-aware Data Shuffling Operator for Parallel Database Systems

机译:用于并行数据库系统的RDMA感知数据洗牌操作员的设计与评估

获取原文
获取原文并翻译 | 示例
       

摘要

The commoditization of high-performance networking has sparked research interest in the RDMA capability of this hardware. One-sided RDMA primitives, in particular, have generated substantial excitement due to the ability to directly access remote memory from within an application without involving the TCP/IP stack or the remote CPU. This article considers how to leverage RDMA to improve the analytical performance of parallel database systems. To shuffle data efficiently using RDMA, one needs to consider a complex design space that includes (1) the number of open connections, (2) the contention for the shared network interface, (3) the RDMA transport function, and (4) how much memory should be reserved to exchange data between nodes during query processing. We contribute eight designs that capture salient tradeoffs in this design space as well as an adaptive algorithm to dynamically manage RDMA-registered memory. We comprehensively evaluate how transport-layer decisions impact the query performance of a database system for different generations of InfiniBand. We find that a shuffling operator that uses the RDMA Send/Receive transport function over the Unreliable Datagram transport service can transmit data up to 4x faster than an RDMA-capable MPI implementation in a 16-node cluster. The response time of TPC-H queries improves by as much as 2x.
机译:高性能网络的商品化引发了这款硬件的RDMA能力的研究兴趣。特别地,单侧RDMA原语由于能够在应用程序内直接访问远程存储器而不涉及TCP / IP堆栈或远程CPU的能力产生了大量兴奋。本文考虑如何利用RDMA来改进并行数据库系统的分析性能。为了有效地使用RDMA进行减速数据,需要考虑一个复杂的设计空间,包括(1)打开连接的数量,(2)共享网络接口的争用,(3)RDMA传输功能,以及(4)如何应保留许多内存以在查询处理期间在节点之间交换数据。我们贡献八个设计,捕获该设计空间中的突出权衡以及自适应算法,以动态管理RDMA注册的内存。我们全面评估运输层决策如何影响不同几代Infiniband的数据库系统的查询性能。我们发现,使用RDMA发送/接收传输功能在不可靠的数据报传输服务中的播种操作员可以在16节点群集中快速传输高达4倍的数据,而不是RDMA的MPI实现。 TPC-H查询的响应时间可提高2倍。

著录项

  • 来源
    《ACM transactions on database systems》 |2019年第4期|17.1-17.45|共45页
  • 作者单位

    Ohio State Univ Columbus OH 43210 USA|395 Dreese Labs 2015 Neil Ave Columbus OH 43210 USA;

    Ohio State Univ Columbus OH 43210 USA|395 Dreese Labs 2015 Neil Ave Columbus OH 43210 USA;

    Ohio State Univ Columbus OH 43210 USA|395 Dreese Labs 2015 Neil Ave Columbus OH 43210 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data shuffling; RDMA; parallel database systems;

    机译:数据混洗;RDMA;并行数据库系统;
  • 入库时间 2022-08-18 21:21:04

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号