首页> 外文期刊>Network, IEEE >Bigprovision: a provisioning framework for big data analytics
【24h】

Bigprovision: a provisioning framework for big data analytics

机译:Bigprovision:大数据分析的供应框架

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In the past few years, big data has attracted significant attention, and many analytics platforms, such as Hadoop, have been developed to enable the analysis of massive data. Nevertheless, it is still very challenging to provision, let alone optimize, a comprehensive system that includes various aspects, from the computing infrastructure to the analytics programs. To tackle this challenge, in this article, we propose a novel provisioning framework, BigProvision, to provision big data analytics systems. The main idea of the framework is to first evaluate and model the performance of different big data analytics approaches, given a set of sample data and various analytics requirements, such as the expected results, budget, response time, and so on. Based on the evaluation and modeling results, BigProvision can generate a provisioning configuration that can be used to configure the whole system for big data analytics. To evaluate the performance of the proposed framework, we develop an experimental prototype that supports three analytics platforms, Hadoop, Spark, and GraphLab. Our experiments show that for the classic PageRank analysis, both GraphLab and Spark can outperform Hadoop under different requirements. Moreover, by modeling the results, our prototype can determine the expected settings, such as the number of machines and network capacity, for the system that shall handle the complete data set. The prototype and experiments demonstrate that the proposed framework has great potential to facilitate the provision and optimization of future big data analytics systems.
机译:在过去的几年中,大数据已引起广泛关注,并且开发了许多分析平台(例如Hadoop)来分析海量数据。然而,提供,更不用说优化一个包括从计算基础架构到分析程序的各个方面的综合系统仍然是非常具有挑战性的。为了应对这一挑战,在本文中,我们提出了一个新颖的供应框架BigProvision,以供应大数据分析系统。该框架的主要思想是在给定一组样本数据和各种分析要求(例如预期结果,预算,响应时间等)的情况下,首先评估和建模不同大数据分析方法的性能。根据评估和建模结果,BigProvision可以生成预配配置,该配置可用于配置整个系统以进行大数据分析。为了评估所提出框架的性能,我们开发了一个实验原型,该原型支持三个分析平台:Hadoop,Spark和GraphLab。我们的实验表明,对于经典的PageRank分析,在不同要求下GraphLab和Spark都可以胜过Hadoop。此外,通过对结果进行建模,我们的原型可以确定应处理完整数据集的系统的预期设置,例如机器数量和网络容量。原型和实验表明,提出的框架具有极大的潜力,可以促进未来大数据分析系统的提供和优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号