首页> 外文OA文献 >Elasca: Workload-Aware Elastic Scalability for Partition Based Database Systems
【2h】

Elasca: Workload-Aware Elastic Scalability for Partition Based Database Systems

机译:Elasca:基于分区的数据库系统的工作负载感知弹性可伸缩性

摘要

Providing the ability to increase or decrease allocated resources on demand as the transactional load varies is essential for database management systems (DBMS) deployed on today's computing platforms, such as the cloud. The need to maintain consistency of the database, at very large scales, while providing high performance and reliability makes elasticity particularly challenging. In this thesis, we exploit data partitioning as a way to provide elastic DBMS scalability. We assert that the flexibility provided by a partitioned, shared-nothing parallel DBMS can be used to implement elasticity. Our idea is to start with a small number of servers that manage all the partitions, and to elastically scale out by dynamically adding new servers and redistributing database partitions among these servers as the load varies. Implementing this approach requires (a) efficient mechanisms for addition/removal of servers and migration of partitions, and (b) policies to efficiently determine the optimal placement of partitions on the given servers as well as plans for partition migration.This thesis presents Elasca, a system that implements both these features in an existing shared-nothing DBMS (namely VoltDB) to provide automatic elastic scalability. Elasca consists of a mechanism for enabling elastic scalability, and a workload-aware optimizer for determining optimal partition placement and migration plans. Our optimizer minimizes computing resources required and balances load effectively without compromising system performance, even in the presence of variations in intensity and skew of the load. The results of our experiments show that Elasca is able to achieve performance close to a fully provisioned system while saving 35% resources on average. Furthermore, Elasca's workload-aware optimizer performs up to 79% less data movement than a greedy approach to resource minimization, and also balance load much more effectively.
机译:随着事务负载的变化,提供按需增加或减少分配的资源的能力对于部署在当今计算平台(例如云)上的数据库管理系统(DBMS)至关重要。在提供高性能和可靠性的同时,需要大规模维护数据库的一致性,这使弹性特别具有挑战性。在本文中,我们利用数据分区作为一种提供弹性DBMS可伸缩性的方法。我们断言,由无共享分区的并行DBMS提供的灵活性可用于实现弹性。我们的想法是从管理所有分区的少量服务器开始,并通过动态添加新服务器并在负载变化时在这些服务器之间重新分配数据库分区来进行弹性扩展。要实施此方法,需要(a)有效的服务器添加/删除和分区迁移机制,以及(b)有效地确定给定服务器上分区的最佳放置位置的策略以及分区迁移计划。本文介绍了Elasca,一个在现有的无共享DBMS(即VoltDB)中实现这两个功能的系统,以提供自动弹性可伸缩性。 Elasca包含一个用于实现弹性可伸缩性的机制,以及一个用于确定最佳分区放置和迁移计划的工作负载感知优化器。我们的优化器可以最大程度地减少所需的计算资源,并在不影响系统性能的情况下有效地平衡负载,即使存在强度和负载偏差的情况。我们的实验结果表明,Elasca能够实现接近完全配置的系统的性能,同时平均节省35%的资源。此外,与贪婪的资源最小化方法相比,Elasca的工作负载感知优化器最多可减少79%的数据移动,并且还可以更有效地平衡负载。

著录项

  • 作者

    Rafiq Taha;

  • 作者单位
  • 年度 2013
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号