首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >A High-Performance Distributed Relational Database System for Scalable OLAP Processing
【24h】

A High-Performance Distributed Relational Database System for Scalable OLAP Processing

机译:用于可伸缩OLAP处理的高性能分布式关系数据库系统

获取原文

摘要

The scalability of systems such as Hive and Spark SQL that are built on top of big data platforms have enabled query processing over very large data sets. However, the per-node performance of these systems is typically low compared to traditional relational databases. Conversely, Massively Parallel Processing (MPP) databases do not scale as well as these systems. We present HRDBMS, a fully implemented distributed shared-nothing relational database developed with the goal of improving the scalability of OLAP queries. HRDBMS achieves high scalability through a principled combination of techniques from relational and big data systems with novel communication and work-distribution techniques. While we also support serializable transactions, the system has not been optimized for this use case. HRDBMS runs on a custom distributed and asynchronous execution engine that was built from the ground up to support highly parallelized operator implementations. Our experimental comparison with Hive, Spark SQL, and Greenplum confirms that HRDBMS's scalability is on par with Hive and Spark SQL (up to 96 nodes) while its per-node performance can compete with MPP databases like Greenplum.
机译:建立在大数据平台之上的Hive和Spark SQL等系统的可伸缩性使对超大型数据集的查询处理成为可能。但是,与传统的关系数据库相比,这些系统的每节点性能通常较低。相反,大规模并行处理(MPP)数据库的扩展性不及这些系统。我们提出了HRDBMS,这是一个完全实现的分布式无共享关系数据库,其开发目的是提高OLAP查询的可伸缩性。 HRDBMS通过将关系型和大数据系统中的技术与新颖的通信和工作分配技术进行原则性组合,从而实现了高可伸缩性。尽管我们还支持可序列化的事务,但尚未针对该用例对系统进行优化。 HRDBMS在自定义的分布式异步执行引擎上运行,该引擎是从头开始构建的,以支持高度并行化的操作员实现。我们与Hive,Spark SQL和Greenplum进行的实验比较证实,HRDBMS的可扩展性与Hive和Spark SQL(最多96个节点)相当,而HRDBMS的单节点性能可以与Greenplum等MPP数据库竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号