【24h】

SparkBench-A Spark Performance Testing Suite

机译:SparkBench-A Spark性能测试套件

获取原文

摘要

Spark has emerged as an easy to use, scalable, robust and fast system for analytics with a rapidly growing and vibrant community of users and contributors. It is multipurpose - with extensive and modular infrastructure for machine learning, graph processing, SQL, streaming, statistical processing, and more. Its rapid adoption therefore calls for a performance assessment suite that supports agile development, measurement, validation, optimization, configuration, and deployment decisions across a broad range of platform environments and test cases. Recognizing the need for such comprehensive and agile testing, this paper proposes going beyond existing performance tests for Spark and creating an expanded Spark performance testing suite. This proposal describes several desirable properties flowing from the larger scale, greater and evolving variety, and nuanced requirements of different applications of Spark. The paper identifies the major areas of performance characterization, and the key methodological aspects that should be factored into the design of the proposed suite. The objective is to capture insights from industry and academia on how to best characterize capabilities of Spark-based analytic platforms and provide cost-effective assessment of optimization opportunities in a timely manner.
机译:Spark已成为易于使用,可扩展,强大而快速,系统的分析,具有快速增长和富力的用户和贡献者社区。它是多功能的 - 用于机器学习的广泛和模块化基础设施,图形处理,SQL,流,统计处理等。因此,它的快速采用呼吁进行智能开发,测量,验证,优化,配置以及在广泛的平台环境和测试用例的敏捷开发,测量,验证,优化,配置和部署决策的绩效评估套件。认识到需要这种全面和敏捷的测试,本文提出超出现有的火花和创建扩展火花性能测试套件的性能测试。该提案描述了几种从较大规模,更大和不断发展的种类流动的理想特性,以及火花的不同应用的细节要求。本文确定了绩效表征的主要领域,以及应考虑到建议套件的设计中的关键方法论方面。目的是捕捉工业和学术界的见解,了解如何最佳地表征火花的分析平台的能力,并及时提供优化机会的经济有效评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号