首页> 外文OA文献 >High-Speed Query Processing over High-Speed Networks
【2h】

High-Speed Query Processing over High-Speed Networks

机译:高速网络上的高速查询处理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Modern database clusters entail two levels of networks: connecting CPUs andNUMA regions inside a single server in the small and multiple servers in thelarge. The huge performance gap between these two types of networks used toslow down distributed query processing to such an extent that a cluster ofmachines actually performed worse than a single many-core server. The increasedmain-memory capacity of the cluster remained the sole benefit of such ascale-out. The economic viability of high-speed interconnects such as InfiniBand hasnarrowed this performance gap considerably. However, InfiniBand's highernetwork bandwidth alone does not improve query performance as expected when thedistributed query engine is left unchanged. The scalability of distributedquery processing is impaired by TCP overheads, switch contention due touncoordinated communication, and load imbalances resulting from theinflexibility of the classic exchange operator model. This paper presents theblueprint for a distributed query engine that addresses these problems byconsidering both levels of networks holistically. It consists of two parts:First, hybrid parallelism that distinguishes local and distributed parallelismfor better scalability in both the number of cores as well as servers. Second,a novel communication multiplexer tailored for analytical database workloadsusing remote direct memory access (RDMA) and low-latency network scheduling forhigh-speed communication with almost no CPU overhead. An extensive evaluationwithin the HyPer database system using the TPC-H benchmark shows that ourholistic approach indeed enables high-speed query processing over high-speednetworks.
机译:现代数据库集群需要两个级别的网络:在小型服务器中的单个服务器和大型服务器中的多个服务器之间连接CPU和NUMA区域。这两种类型的网络之间巨大的性能差距曾使分布式查询处理速度减慢到一定程度,以致一簇机器实际上比一台多核服务器的性能差。集群增加的主内存容量仍然是这种横向扩展的唯一好处。诸如InfiniBand之类的高速互连的经济可行性大大缩小了这种性能差距。但是,当分布式查询引擎保持不变时,仅InfiniBand的较高网络带宽并不能提高查询性能。 TCP开销,由于通信不协调而引起的交换机争用以及经典交换运算符模型的不灵活性导致的负载不平衡会削弱分布式查询处理的可伸缩性。本文提出了一种分布式查询引擎的蓝图,该引擎通过全面考虑网络的两个级别来解决这些问题。它由两部分组成:第一,混合并行机制,区分本地并行机制和分布式并行机制,以在内核和服务器数量上实现更好的可伸缩性。其次,一种新颖的通信多路复用器专为分析数据库的工作量身定制,它使用远程直接内存访问(RDMA)和低延迟网络调度来实现几乎没有CPU开销的高速通信。在使用TPC-H基准的HyPer数据库系统中进行的广泛评估表明,我们的整体方法确实可以在高速网络上实现高速查询处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号