首页> 外文会议>International Conference on Systems and Informatics >Otterman: A Novel Approach of Spark Auto-tuning by a Hybrid Strategy
【24h】

Otterman: A Novel Approach of Spark Auto-tuning by a Hybrid Strategy

机译:Otterman:一种基于混合策略的火花自动调整的新方法

获取原文

摘要

Spark has become a very attractive platform for big data analytics in recent years due to its unique advantages such as parallelism, fault tolerance, and complexity associated with clusters setup. On the spark platform, users can adjust parameter configurations according to different job requirements and specific applications to optimize performance. This leads to a problem that we can't ignore, Spark already has more than 180 parameters, and its huge combination of parameters means that we can't rely on manual tuning to grasp the impact of all parameters on performance. In order to solve the problem of relying heavily on expert experience and manual operation, we propose Otterman, a parameters optimization approach based on the combination of Simulated Annealing algorithm and Least Squares method, which can help us dynamically adjust parameters according to job types to obtain optimal configuration to improve performance. Simulated Annealing can find the optimal solution, but has poor convergence. We make use of the Least Squares method to effectively improve the speed at which the former converges to the optimal solution. Otterman is simple and easy to perform, with no additional cost. The effectiveness of the approach is verified by experiments, the results show that Otterman's average performance has increased by 30% compared to the default parameters configuration, with an accuracy of about 68%.
机译:由于其独特的优势,例如并行性,容错性以及与群集设置相关的复杂性,Spark近年来已成为大数据分析的非常有吸引力的平台。在spark平台上,用户可以根据不同的工作要求和特定的应用程序调整参数配置,以优化性能。这导致了一个我们不能忽略的问题,Spark已经拥有超过180个参数,其庞大的参数组合意味着我们不能依靠手动调整来掌握所有参数对性能的影响。为了解决严重依赖专家经验和人工操作的问题,我们提出了Otterman,一种基于模拟退火算法和最小二乘方法相结合的参数优化方法,可以帮助我们根据工作类型动态调整参数以获得优化配置以提高性能。模拟退火可以找到最佳解决方案,但收敛性较差。我们使用最小二乘法来有效提高前者收敛到最优解的速度。 Otterman简单易行,无需支付额外费用。通过实验验证了该方法的有效性,结果表明,与默认参数配置相比,Otterman的平均性能提高了30%,准确度约为68%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号