首页> 外文会议>International conference on management of data >Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems
【24h】

Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems

机译:Skew-Invustn自动数据库分区在共享无线,并行OLTP系统中

获取原文

摘要

The advent of affordable, shared-nothing computing systems portends a new class of parallel database management systems (DBMS) for on-line transaction processing (OLTP) applications that scale without sacrificing ACID guarantees [7, 9]. The performance of these DBMSs is predicated on the existence of an optimal database design that is tailored for the unique characteristics of OLTP workloads [43]. Deriving such designs for modern DBMSs is difficult, especially for enterprise-class OLTP systems, since they impose extra challenges: the use of stored procedures, the need for load balancing in the presence of time-varying skew, complex schemas. and deployments with larger number of partitions. To this purpose, we present a novel approach to automatically partitioning databases for enterprise-class OLTP systems that significantly extends the state of the art by: (1) minimizing the number distributed transactions, while concurrently mitigating the effects of temporal skew in both the data distribution and accesses, (2) extending the design space to include replicated secondary indexes. (4) organically handling stored procedure routing, and (3) scaling of schema complexity, data size, and number of partitions. This effort builds on two key technical contributions: an analytical cost model that can be used to quickly estimate the relative coordination cost and skew for a given workload and a candidate database design, and an informed exploration of the huge solution space based on large neighborhood search. To evaluate our methods, we integrated our database design tool with a high-performance parallel, main memory DBMS and compared our methods against both popular heuristics and a state-of-the-art research prototype [17]. Using a diverse set of benchmarks, we show that our approach improves throughput by up to a factor of 16 × over these other approaches.
机译:经济实惠的,共享除了计算系统的出现,为在线交易处理(OLTP)应用程序的新类并行数据库管理系统(DBMS),其在不牺牲酸保证的情况下进行比例[7,9]。这些DBMS的性能取决于存在于为OLTP工作负载的独特特征量身定制的最佳数据库设计[43]。导出现代DBMSS的这种设计很困难,特别是对于企业级的OLTP系统,因为它们施加了额外的挑战:使用存储过程,在存在时变偏差的情况下,需要负载平衡。和部署具有较大分区。为此目的,我们提出了一种新的方法来自动分区企业级OLTP系统的数据库,该系统显着扩展了最新状态:(1)最小化数字分布式事务,同时兼容数据偏斜在数据中的时间偏斜的影响分发和访问,(2)扩展设计空间以包括复制的辅助索引。 (4)有机处理存储过程路由,以及(3)模式复杂性,数据大小和分区数量的缩放。这项努力构建了两个关键的技术贡献:一个分析成本模型,可用于快速估计给定工作量和候选数据库设计的相对协调成本和偏差,以及基于大街区搜索的巨大解决方案空间的明智探索。为了评估我们的方法,我们将数据库设计工具与高性能并行,主内存DBMS集成,并将我们的方法与流行的启发式和最先进的研究原型进行了比较[17]。使用各种基准测试,我们表明我们的方法通过这些其他方法提高了吞吐量达到16倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号