首页> 外文会议>IEEE International Conference on Big Data >ECL-watch: A big data application performance tuning tool in the HPCC systems platform
【24h】

ECL-watch: A big data application performance tuning tool in the HPCC systems platform

机译:ECL-Watch:HPCC系统平台中的大数据应用性能调整工具

获取原文

摘要

The proliferation of Big Data processing environments such as Hadoop, Apache Spark, and HPCC Systems is driving the development of performance analysis tools in these distributed systems. The goal is to achieve high performance through the optimization of Big Data applications. However, tuning performance in a fine-grained manner is quite challenging due to the high complexity and massive size of the distributed systems. ECL-Watch is a data-flow based fine-grained comprehensive Big Data performance analysis tool utilizing the high level declarative dataflow programming language ECL in HPCC Systems. As a case study, we implement and optimize the Yinyang K-Means machine learning algorithm in ECL in HPCC Systems. The experimental results show that the performance of the native ECL version of the Yinyang K-Means algorithm increased significantly after tuning: from being about three times slower than the standard K-Means implementation in ECL, to become roughly 15% faster than standard K-Means.
机译:大数据处理环境(如Hadoop,Apache Spark和HPCC系统)的扩散正在推动这些分布式系统中的性能分析工具的开发。目标是通过优化大数据应用来实现高性能。然而,由于分布式系统的高度复杂性和大规模大小,以细粒化的方式调整性能非常具有挑战性。 ECL-WATIO是一种基于数据流的细粒度综合大数据性能分析工具,利用HPCC系统中的高级声明性数据流程编程语言ECL。作为一个案例研究,我们在HPCC系统中实施和优化ECL中的YINYANG K-MEARE机器学习算法。实验结果表明,在调整后,幂阳K型算法的天然ECL版本的性能显着增加:从ECL中的标准K均值慢三倍增加了大约3倍,比标准K为大约15 % -方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号