首页> 外文期刊>ACM Transactions on Modeling and Performance Evaluation of Computing Systems >Adaptive Performance Modeling of Data-intensive Workloads for Resource Provisioning in Virtualized Environment
【24h】

Adaptive Performance Modeling of Data-intensive Workloads for Resource Provisioning in Virtualized Environment

机译:虚拟环境资源配置数据密集型工作负载的自适应性能建模

获取原文
获取原文并翻译 | 示例
           

摘要

The processing of data-intensive workloads is a challenging and time-consuming task that often requires massive infrastructure to ensure fast data analysis. The cloud platform is the most popular and powerful scale-out infrastructure to perform big data analytics and eliminate the need to maintain expensive and high-end computing resources at the user side. The performance and the cost of such infrastructure depend on the overall server configuration, such as processor, memory, network, and storage configurations. In addition to the cost of owning or maintaining the hardware, the heterogeneity in the server configuration further expands the selection space, leading to non-convergence. The challenge is further exacerbated by the dependency of the application's performance on the underlying hardware. Despite an increasing interest in resource provisioning, few works have been done to develop accurate and practical models to proactively predict the performance of data-intensive applications corresponding to the server configuration and provision a cost-optimal configuration online.In this work, through a comprehensive real-system empirical analysis of performance, we address these challenges by introducing ProMLB: a proactive machine-learning-based methodology for resource provisioning. We first characterize diverse types of data-intensive workloads across different types of server architectures. The characterization aids in accurately capture applications' behavior and train a model for prediction of their performance.Then, ProMLB builds a set of cross-platform performance models for each application. Based on the developed predictive model, ProMLB uses an optimization technique to distinguish close-to-optimal configuration to minimize the product of execution time and cost. Compared to the oracle scheduler, ProMLB achieves 91% accuracy in terms of application-resource matching. On average, ProMLB improves the performance and resource utilization by 42.6% and 41.1%, respectively, compared to baseline scheduler. Moreover, ProMLB improves the performance per cost by 2.5× on average.
机译:数据密集型工作负载的处理是一个具有挑战性和耗时的任务,通常需要大量基础设施来确保快速数据分析。云平台是最流行和最强大的扩展基础设施,以执行大数据分析,并消除在用户侧维持昂贵和高端计算资源的需要。此类基础架构的性能和成本取决于整体服务器配置,例如处理器,内存,网络和存储配置。除了拥有或维护硬件的成本之外,服务器配置中的异构性还扩展了选择空间,导致非收敛。通过应用程序在底层硬件上的性能的依赖性进一步加剧了挑战。尽管对资源配置的兴趣日益越来越少,但是已经采取了很少的作品来开发准确和实用的模型,以主动预测与服务器配置相对应的数据密集型应用程序的性能,并通过全面的工作提供成本最佳配置。实际系统的性能分析,我们通过引入PROMLB来解决这些挑战:基于主动的机器学习的资源供应方法。我们首先在不同类型的服务器架构上表征各种类型的数据密集型工作负载。特性辅助辅助应用程序的行为和培训模型,以预测其性能。然后,PROMLB为每个应用程序构建一组跨平台性能模型。基于开发的预测模型,PROMLB使用优化技术来区分近距离的配置,以最小化执行时间和成本的乘积。与Oracle Scheduler相比,PROMLB在应用程序资源匹配方面取得了91%的准确性。平均而言,与基线调度程序相比,PROMLB分别将性能和资源利用提高了42.6%和41.1%。此外,PROMLB平均地提高了每次成本2.5倍的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号