首页> 外文期刊>Computer and Information Science >Hadoop Based Data Intensive Computation on IaaS Cloud Platforms
【24h】

Hadoop Based Data Intensive Computation on IaaS Cloud Platforms

机译:在IaaS云平台上基于Hadoop的数据密集计算

获取原文
       

摘要

Cloud computing is a relatively new form of computing, which uses virtualized resources and is dynamically scalable and is often provided as pay for use service over the Internet or Intranet or both. With increasing demand for data storage in the cloud, study of data intensive applications is becoming a primary focus. Data intensive applications are those which involve a high CPU usage, processsing large volumes of data typically in size of hundreds of gigabytes, terabytes, or petabytes. This study was conducted on Amazon's Elastic Cloud Compute?(EC2) and Amazon Elastic Map Reduce (EMR)?using HiBench Hadoop Benchmark Suite. HiBench is a Hadoop benchmark suite and is used for performing and evaluating Hadoop based data intensive computation on both these cloud paltforms. Both quantitative and qualitative comparison was performed on both Amazon EC2 and Amazon EMR, including a study of their pricing models and measures are suggested for future studies and research.
机译:云计算是一种相对较新的计算形式,它使用虚拟化资源并具有动态可伸缩性,通常作为通过Internet或Intranet或二者的按使用付费服务提供。随着对云中数据存储需求的增长,对数据密集型应用程序的研究已成为主要重点。数据密集型应用程序是那些占用大量CPU的应用程序,用于处理通常为数百GB,TB或PB大小的大量数据。这项研究是使用HiBench Hadoop Benchmark Suite在Amazon的Elastic Cloud Compute?(EC2)和Amazon Elastic Map Reduce(EMR)上进行的。 HiBench是Hadoop基准套件,用于在这两种云平台上执行和评估基于Hadoop的数据密集型计算。在Amazon EC2和Amazon EMR上都进行了定量和定性比较,包括对它们的定价模型和度量的研究,建议用于将来的研究和研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号