首页> 外文期刊>The VLDB journal >Resource bricolage and resource selection for parallel database systems
【24h】

Resource bricolage and resource selection for parallel database systems

机译:并行数据库系统的资源权限和资源选择

获取原文
获取原文并翻译 | 示例
       

摘要

Running parallel database systems in an environment with heterogeneous resources has become increasingly common, due to cluster evolution and increasing interest in moving applications into public clouds. Performance differences among machines in the same cluster pose new challenges for parallel database systems. First, for database systems running in a heterogeneous cluster, the default uniform data partitioning strategy may overload some of the slow machines, while at the same time it may underutilize the more powerful machines. Since the processing time of a parallel query is determined by the slowest machine, such an allocation strategy may result in a significant query performance degradation. Second, since machines might have varying resources or performance, different choices of machines may lead to different costs or performance for executing the same workload. By carefully selecting the most suitable machines for running a workload, we may achieve better performance with the same budget, or we may meet the same performance requirements with a lower cost. We address these challenges by introducing techniques we call resource bricolage and resource selection that improve database performance in heterogeneous environments. Our approaches quantify the performance differences among machines with various resources as they process workloads with diverse resource requirements. For the purpose of better resource utilization, we formalize the problem of minimizing workload execution time and view it as an optimization problem, and then, we employ linear programming to obtain a recommended data partitioning scheme. For the purpose of better resource selection, we formalize two problems: One minimizes the total workload execution time with a given budget, and the other minimizes the total budget with a given performance target. We then employ different mixed-integer programs to search for the optimal resource selection decisions. We verify the effectiveness of both resource bricolage and resource selection techniques with an extensive experimental study.
机译:由于集群的发展以及对将应用程序迁移到公共云中的兴趣日益浓厚,在具有异构资源的环境中运行并行数据库系统已变得越来越普遍。同一集群中机器之间的性能差异对并行数据库系统提出了新的挑战。首先,对于在异构集群中运行的数据库系统,默认的统一数据分区策略可能会使某些速度较慢的计算机过载,而同时又可能无法充分利用功能更强大的计算机。由于并行查询的处理时间是由最慢的机器确定的,因此这种分配策略可能会导致查询性能显着下降。其次,由于机器可能具有不同的资源或性能,因此不同的机器选择可能会导致执行相同工作负载的成本或性能有所不同。通过仔细选择最适合运行工作负载的机器,我们可以在相同的预算下获得更好的性能,或者我们可以以更低的成本满足相同的性能要求。我们通过引入称为资源桥接和资源选择的技术来解决这些挑战,这些技术可以提高异构环境中的数据库性能。我们的方法可以量化具有各种资源的计算机在处理具有各种资源需求的工作负载时的性能差异。为了更好地利用资源,我们将最小化工作负载执行时间的问题形式化,并将其视为优化问题,然后,我们采用线性编程来获得推荐的数据分区方案。为了更好地选择资源,我们对两个问题进行了形式化处理:一个以给定的预算最小化总工作量执行时间,另一个以给定的性能目标最小化总预算。然后,我们采用不同的混合整数程序来搜索最佳资源选择决策。我们通过广泛的实验研究验证了资源筛选和资源选择技术的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号