首页> 外文会议>International Conference on Distributed Computing Systems Workshops >A Reliability Benchmark for Big Data Systems on JointCloud
【24h】

A Reliability Benchmark for Big Data Systems on JointCloud

机译:合联网大数据系统的可靠性基准

获取原文

摘要

JointCloud provides a large-scale, flexible, and elastic computing resource platform. Big data systems such as MapReduce and Spark are widely deployed on this platform for big data processing. How to choose a cloud platform in accordance with the need of customers is a problem. Current performance benchmarking suites can choose suitable cloud platforms for customers. However, they do not consider the reliability of applications running atop big data systems. These systems have high scalability, but the applications running atop them often generate runtime errors, such as out of memory errors, I/O exceptions, and task timeouts. For users, they want to know whether the developed applications have potential application faults. For system designers and managers, they want to know whether the deployed/updated systems have potential system faults. In addition, current benchmarks for big data system are also only designed for performance testing. To fill this gap, we propose a reliability benchmark, which contains representative applications, an abnormal data generator, and a configuration combination generator. Different from performance benchmarks, this benchmark (1) generates abnormal test data according to the application characteristics, and (2) reduces the configuration combination space based on configuration features. Currently, we implemented this benchmark on Spark system. In our preliminary test, we found three types of errors (i.e., out of memory errors, timeout and wrong results) in five SQL, Machine Learning, and Graph applications.
机译:ConntCloud提供大规模,灵活和弹性的计算资源平台。大数据系统(如MapReduce和Spark)广泛部署在该平台上,以实现大数据处理。如何按照客户的需要选择云平台是一个问题。目前的性能基准套房可以为客户选择合适的云平台。但是,它们不考虑在大数据系统上运行的应用程序的可靠性。这些系统具有高可扩展性,但在其上运行的应用程序通常会生成运行时错误,例如退出内存错误,I / O异常和任务超时。对于用户来说,他们想知道开发的应用是否具有潜在的应用故障。对于系统设计者和管理者,他们想知道部署/更新的系统是否具有潜在的系统故障。此外,大数据系统的当前基准也仅用于性能测试。为了填补此差距,我们提出了一种可靠性基准,其中包含代表性应用程序,异常数据发生器和配置组合生成器。与性能基准不同,此基准(1)根据应用特征生成异常测试数据,(2)基于配置功能减少配置组合空间。目前,我们在Spark系统上实施了该基准。在我们的初步测试中,我们在五个SQL,机器学习和图形应用中找到了三种类型的错误(即,内存错误,超时和错误的结果)。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号