首页> 外文会议>IEEE International Conference on Cloud Computing >Towards Automatic Tuning of Apache Spark Configuration
【24h】

Towards Automatic Tuning of Apache Spark Configuration

机译:致力于自动调整Apache Spark配置

获取原文

摘要

Apache Spark provides a large number of configuration settings that may be tuned to improve the performance of specific applications running on the platform. However, it is non-trivial to identify the combination of settings that may improve the performance of a specific application as the influence of each setting on performance may vary across applications. As identifying the optimal combination of settings is computationally infeasible due to exponential search space, in this paper we investigate machine learning based approaches to construct application specific performance influence models, and use them to tune the performance of specific applications running on Apache Spark platform. We evaluated our approach using 9 different applications on a 6 node cluster and demonstrated that our framework can reduce execution time by 22.8% to 40.0% depending on applications.
机译:Apache Spark提供了许多配置设置,可以对其进行调整以提高平台上运行的特定应用程序的性能。但是,识别可以改善特定应用程序性能的设置组合并非易事,因为每个设置对性能的影响可能会因应用程序而异。由于由于指数搜索空间而无法确定设置的最佳组合在计算上是不可行的,因此在本文中,我们研究了基于机器学习的方法来构建应用程序特定的性能影响模型,并使用它们来调整在Apache Spark平台上运行的特定应用程序的性能。我们在6个节点的群集上使用9个不同的应用程序评估了我们的方法,并证明了我们的框架可以根据应用程序将执行时间减少22.8%至40.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号