首页> 外文会议>Data Engineering Workshops (ICDEW), 2010 >The HiBench benchmark suite: Characterization of the MapReduce-based data analysis
【24h】

The HiBench benchmark suite: Characterization of the MapReduce-based data analysis

机译:HiBench基准套件:基于MapReduce的数据分析的特征

获取原文

摘要

The MapReduce model is becoming prominent for the large-scale data analysis in the cloud. In this paper, we present the benchmarking, evaluation and characterization of Hadoop, an open-source implementation of MapReduce. We first introduce HiBench, a new benchmark suite for Hadoop. It consists of a set of Hadoop programs, including both synthetic micro-benchmarks and real-world Hadoop applications. We then evaluate and characterize the Hadoop framework using HiBench, in terms of speed (i.e., job running time), throughput (i.e., the number of tasks completed per minute), HDFS bandwidth, system resource (e.g., CPU, memory and I/O) utilizations, and data access patterns.
机译:MapReduce模型对于云中的大规模数据分析正变得越来越重要。在本文中,我们介绍了Hadoop的基准测试,评估和特性,Hadoop是MapReduce的开源实现。我们首先介绍HiBench,这是Hadoop的新基准套件。它由一组Hadoop程序组成,包括合成的微基准测试和实际的Hadoop应用程序。然后,我们使用HiBench在速度(即作业运行时间),吞吐量(即每分钟完成的任务数),HDFS带宽,系统资源(例如CPU,内存和I /)方面评估和表征Hadoop框架。 O)利用率和数据访问模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号