首页> 外文期刊>Cloud Computing, IEEE Transactions on >Faster MapReduce Computation on Clouds Through Better Performance Estimation
【24h】

Faster MapReduce Computation on Clouds Through Better Performance Estimation

机译:通过更好的性能估算,在云上更快地执行MapReduce计算

获取原文
获取原文并翻译 | 示例
           

摘要

Processing Big Data in cloud is on the increase. An important issue for efficient execution of Big Data processing jobs on a cloud platform is selecting the best fitting virtual machine (VM) configuration(s) among the miscellany of choices that cloud providers offer. Wise selection of VM configurations can lead to better performance, cost and energy consumption. Therefore, it is crucial to explore the available configurations and opt for the best ones that well suit each MapReduce application. Profiling the given application on all the configurations is costly, time and energy consuming. An alternative is to run the application on a subset of configurations (sample configurations) and estimate its performance on other configurations based on the obtained values by sample configurations. We show that the choice of these sample configurations highly affects accuracy of later estimations. Our Smart Configuration Selection (SCS) scheme chooses better representatives from among all configurations by once-off analysis of given performance figures of the benchmarks so as to increase the accuracy of estimations of missing values, and consequently, to more accurately choose the configuration providing the highest performance. The results show that the SCS choice of sample configurations is very close to the best choice, and can reduce estimation error to 11.58 percent from the original 19.72 percent of random configuration selection. More importantly, using SCS estimations in a makespan minimization algorithm improves the execution time by up to 36.03 percent compared with random sample selection.
机译:在云中处理大数据的趋势正在增加。在云平台上有效执行大数据处理作业的一个重要问题是,在云提供商提供的各种选择中,选择最合适的虚拟机(VM)配置。明智地选择VM配置可以带来更好的性能,成本和能耗。因此,至关重要的是探索可用的配置并选择最适合每个MapReduce应用程序的最佳配置。在所有配置上对给定的应用程序进行性能分析非常昂贵,耗时且耗能。一种替代方法是在配置的子集(样本配置)上运行该应用程序,并基于样本配置所获得的值来估计其在其他配置上的性能。我们表明,这些样本配置的选择会极大地影响以后估算的准确性。我们的智能配置选择(SCS)方案通过对基准的给定性能数据进行一次分析,从所有配置中选择更好的代表,从而提高了缺失值估算的准确性,因此,可以更准确地选择提供以下配置的配置:最高的性能。结果表明,样本配置的SCS选择非常接近最佳选择,并且可以将估计误差从随机配置选择的原始19.72%降低到11.58%。更重要的是,与随机样本选择相比,在制造期最小化算法中使用SCS估计可将执行时间提高多达36.03%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号