High-Speed Query Processing over High-Speed Networks

机译：高速网络上的高速查询处理

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Modern database clusters entail two levels of networks: connecting CPUs andNUMA regions inside a single server in the small and multiple servers in thelarge. The huge performance gap between these two types of networks used toslow down distributed query processing to such an extent that a cluster ofmachines actually performed worse than a single many-core server. The increasedmain-memory capacity of the cluster remained the sole benefit of such ascale-out. The economic viability of high-speed interconnects such as InfiniBand hasnarrowed this performance gap considerably. However, InfiniBand's highernetwork bandwidth alone does not improve query performance as expected when thedistributed query engine is left unchanged. The scalability of distributedquery processing is impaired by TCP overheads, switch contention due touncoordinated communication, and load imbalances resulting from theinflexibility of the classic exchange operator model. This paper presents theblueprint for a distributed query engine that addresses these problems byconsidering both levels of networks holistically. It consists of two parts:First, hybrid parallelism that distinguishes local and distributed parallelismfor better scalability in both the number of cores as well as servers. Second,a novel communication multiplexer tailored for analytical database workloadsusing remote direct memory access (RDMA) and low-latency network scheduling forhigh-speed communication with almost no CPU overhead. An extensive evaluationwithin the HyPer database system using the TPC-H benchmark shows that ourholistic approach indeed enables high-speed query processing over high-speednetworks.

机译：现代数据库集群需要两个级别的网络：在小型服务器中的单个服务器和大型服务器中的多个服务器之间连接CPU和NUMA区域。这两种类型的网络之间巨大的性能差距曾使分布式查询处理速度减慢到一定程度，以致一簇机器实际上比一台多核服务器的性能差。集群增加的主内存容量仍然是这种横向扩展的唯一好处。诸如InfiniBand之类的高速互连的经济可行性大大缩小了这种性能差距。但是，当分布式查询引擎保持不变时，仅InfiniBand的较高网络带宽并不能提高查询性能。 TCP开销，由于通信不协调而引起的交换机争用以及经典交换运算符模型的不灵活性导致的负载不平衡会削弱分布式查询处理的可伸缩性。本文提出了一种分布式查询引擎的蓝图，该引擎通过全面考虑网络的两个级别来解决这些问题。它由两部分组成：第一，混合并行机制，区分本地并行机制和分布式并行机制，以在内核和服务器数量上实现更好的可伸缩性。其次，一种新颖的通信多路复用器专为分析数据库的工作量身定制，它使用远程直接内存访问（RDMA）和低延迟网络调度来实现几乎没有CPU开销的高速通信。在使用TPC-H基准的HyPer数据库系统中进行的广泛评估表明，我们的整体方法确实可以在高速网络上实现高速查询处理。

著录项

作者
Roediger, Wolf; Muehlbauer, Tobias; Kemper, Alfons; Neumann, Thomas;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. A Design of High-speed Big Data Query Processing System for Social Data Analysis: Using Spark SQL [J] . Kiejin Park, Limei Peng International Journal of Applied Engineering Research . 2016,第14aPta2期

机译：用于社交数据分析的高速大数据查询处理系统设计：使用Spark SQL
2. Approximate processing of massive continuous quantile queries over high-speed data streams [J] . Lin X., Xu J., Zhang Q., IEEE Transactions on Knowledge and Data Engineering . 2006,第5期

机译：高速数据流上大规模连续分位数查询的近似处理
3. A Novel Network Processor for Security Applications in High-Speed Data Networks [J] . Kyriakos G. Vlachos Bell Labs technical journal . 2003,第1期

机译：适用于高速数据网络中安全应用的新型网络处理器
4. High-Speed Query Processing over High-Speed Networks [C] . Wolf Roediger, Tobias Muehlbauer, Alfons Kemper, International conference on very large data bases . 2016

机译：高速网络上的高速查询处理
5. Advanced Signal Processing for High-speed Optical Wireless Communications and Networking [D] . Hong, Yang 2018

机译：高速光无线通信和网络的高级信号处理
6. Dipping Process Characteristics Based on Image Processing of Pictures Captured by High-speed Cameras [O] . Junhui Li, Yang Xia, Wei Wang, -1

机译：基于高速相机拍摄图像的图像处理的浸渍过程特性
7. Approximate processing of massive continuous quantile queries over high-speed data streams [O] . Xuemin Lin, Jian Xu, Qing Zhang, 2006

机译：高速数据流上大规模连续分位数查询的近似处理

High-Speed Query Processing over High-Speed Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅