首页> 外文会议>IEEE International Conference on Big Data >Plug and play bench: Simplifying big data benchmarking using containers
【24h】

Plug and play bench: Simplifying big data benchmarking using containers

机译:即插即用的平台:使用容器简化大数据基准测试

获取原文

摘要

The recent boom of big data, coupled with the challenges of its processing and storage gave rise to the development of distributed data processing and storage paradigms like MapReduce, Spark, and NoSQL databases. With the advent of cloud computing, processing and storing such massive datasets on clusters of machines is now feasible with ease. However, there are limited tools and approaches, which users can rely on to gauge and comprehend the performance of their big data applications deployed locally on clusters, or in the cloud. Researchers have started exploring this area by providing benchmarking suites suitable for big data applications. However, many of these tools are fragmented, complex to deploy and manage, and do not provide transparency with respect to the monetary cost of benchmarking an application. In this paper, we present Plug And Play Bench (PAPB1): an infrastructure aware abstraction built to integrate and simplify the deployment of big data benchmarking tools on clusters of machines. PAPB automates the tedious process of installing, configuring and executing common big data benchmark workloads by containerising the tools and settings based on the underlying cluster deployment framework. Our proof of concept implementation utilises HiBench as the benchmark suite, HDP as the cluster deployment framework and Azure as the cloud platform. The paper further illustrates the inclusion of cost metrics based on the underlying Microsoft Azure cloud platform.
机译:最近大数据的繁荣,再加上其处理和存储的挑战,导致了诸如MapReduce,Spark和NoSQL数据库之类的分布式数据处理和存储范例的发展。随着云计算的出现,现在可以轻松地在机器集群上处理和存储如此庞大的数据集。但是,工具和方法有限,用户可以依靠这些工具和方法来评估和理解本地部署在集群或云中的大数据应用程序的性能。研究人员已开始通过提供适用于大数据应用程序的基准测试套件来探索这一领域。但是,这些工具中有许多是分散的,部署和管理起来很复杂,并且在基准化应用程序的金钱成本方面没有提供透明性。在本文中,我们介绍了即插即用的基准(PAPB 1 ):一种基础架构感知的抽象,旨在集成和简化在机器集群上大数据基准测试工具的部署。通过基于基础集群部署框架的工具和设置容器化,PAPB可以自动完成繁琐的安装,配置和执行通用大数据基准测试工作负载的过程。我们的概念验证实施使用HiBench作为基准套件,使用HDP作为群集部署框架,并使用Azure作为云平台。本文进一步说明了基于基础Microsoft Azure云平台的成本指标的包含。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号