首页> 外文学位 >Performance Prediction and Resource Bricolage for Database Systems.
【24h】

Performance Prediction and Resource Bricolage for Database Systems.

机译:数据库系统的性能预测和资源Bricolage。

获取原文
获取原文并翻译 | 示例

摘要

With the growth of the Internet, our ability to generate extremely large amounts of data has dramatically increased. This sheer volume of data that needs to be managed and analyzed has led to the wide adoption of very large and complex data management systems. Although these systems can significantly reduce data processing time, issues such as hardware/software skew, resource contention, and failures are more likely to arise. All large and complex systems have to face this unwanted but inevitable fact. Due to all these issues, it gets harder to anticipate the future state of a system, and a one-time decision model used by schedulers, optimizers or resource managers will be vulnerable to state changes.;Meanwhile, running parallel database systems in an environment with heterogeneous resources has become increasingly common, due to cluster evolution and increasing interest in moving applications into public clouds. Very large data processing is increasingly becoming a necessity for modern applications. For database systems running in a heterogeneous cluster, the default data partitioning strategy may overload some of the slow machine while at the same time it may under-utilize the more powerful machines. Since the processing time of a parallel query is determined by the slowest machine, such an allocation strategy may result in significant query performance degradation.;It is not uncommon today for us to decide which computing resources should be used to build a cluster or to run an application from a diverse range of such resources. Very often, when a new cluster is built or an old cluster is upgraded, there are various machines, low-end or high-end, that we can choose from. Different choices may lead to different costs or performance. Thus, we will encounter a resource selection problem if we have a limited budget or a performance goal.;This dissertation makes three contributions by addressing these three problems: query progress estimation, data allocation , and resource selection˙ The first contribution is the design and implementation of a new cost-based query progress indicator, called GSLPI, to produce more accurate progress estimates. The second contribution is a new technique we call resource bricolage that provides a recommended data partitioning scheme to minimize workload execution time in heterogeneous environments. The third contribution is the formalization and solutions for two resource bricolage problems with either a budget constraint or a time constraint. We show that the solution combining both data allocation and resource selection can achieve significant performance improvement over other alternatives.;This dissertation provides a new vision of deploying performance prediction technology in the areas of query optimization, scheduling, and execution, and it also points to promising directions for future studies to improve database performance running in the cloud.
机译:随着Internet的发展,我们生成大量数据的能力得到了极大提高。需要管理和分析的如此庞大的数据量导致了非常大型和复杂的数据管理系统的广泛采用。尽管这些系统可以大大减少数据处理时间,但更可能出现诸如硬件/软件偏斜,资源争用和故障之类的问题。所有大型和复杂的系统都必须面对这个不希望但不可避免的事实。由于所有这些问题,很难预测系统的未来状态,并且调度程序,优化程序或资源管理器使用的一次性决策模型将容易受到状态更改的影响;同时,在环境中运行并行数据库系统由于集群的发展以及对将应用程序迁移到公共云的兴趣日益浓厚,异构资源的使用已变得越来越普遍。对于现代应用程序来说,非常大的数据处理正变得越来越必要。对于在异构集群中运行的数据库系统,默认的数据分区策略可能会使某些速度较慢的计算机过载,同时又可能无法充分利用功能更强大的计算机。由于并行查询的处理时间是由最慢的机器决定的,因此这种分配策略可能会导致查询性能显着下降。今天,我们决定应使用哪些计算资源来构建集群或运行它已经很普遍。来自各种此类资源的应用程序。通常,在构建新集群或升级旧集群时,我们可以选择各种机器(低端或高端)。不同的选择可能导致不同的成本或性能。因此,如果预算或性能目标有限,就会遇到资源选择的问题。本文通过解决这三个问题,对查询进度估计,数据分配和资源选择提出了三点贡献。第一个贡献是设计和实现了一种新的基于成本的查询进度指示器,称为GSLPI,以产生更准确的进度估计。第二个贡献是一种称为资源桥接的新技术,该技术提供了一种推荐的数据分区方案,以最大程度地减少异构环境中的工作负载执行时间。第三个贡献是对两个预算约束或时间约束的资源短缺问题的形式化和解决方案。我们证明了将数据分配和资源选择相结合的解决方案可以比其他方法获得显着的性能提升。;本文为在查询优化,调度和执行领域中部署性能预测技术提供了新的视角,并且指出了未来研究的有希望的方向,以改善在云中运行的数据库性能。

著录项

  • 作者

    Li, Jiexing.;

  • 作者单位

    The University of Wisconsin - Madison.;

  • 授予单位 The University of Wisconsin - Madison.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 140 p.
  • 总页数 140
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号