【24h】

TPCx-HS on the Cloud!

机译:TPCX-HS在云上!

获取原文

摘要

The introduction of web scale operations needed for social media coupled with ease of access to the internet by mobile devices has exponentially increased the amount of data being generated every day. By conservative estimates the world generates close to 50,000 GB of data every second, 90% of which is unstructured, and this growth is accelerating. From its origins as a web log processing system at Yahoo, the open source nature and efficient processing of Apache Hadoop has made it the industry standard for Big Data processing. TPCx-HS was the first benchmark standard by a major Industry-Standard performance consortium for the Big Data space. TPCx-HS is a derivative of Apache Hadoop Workloads; Teragen, Terasort and Teravalidate. Ever since its release by the TPC in August 2014, all the 18 results published (as of August 2016) have been based on on-premise, Bare-metal hardware configurations. This paper will show how Hadoop can be deployed on an OpenStack cloud using the OpenStack Sahara project and how TPCx-HS can be used to measure and evaluate the performance of the Cloud under Test (CuT). It will also show how an OpenStack cloud can be optimized to get the performance of TPCx-HS on the Cloud to match as closely as possible that on a Bare-metal configuration. Lastly, it will share results and experiences based on a Hadoop on Cloud Proof-of-Concept (POC), a study that was undertaken by the Dell Open Source Solutions team.
机译:引进需要通过移动设备再加上轻松访问互联网的社交媒体网站的业务规模成倍增加,每天产生的数据量。据保守估计全世界产生将近50000 GB的数据每一秒,其中90%是非结构化的,而这种增长正在加速。从它的起源是在雅虎网络日志处理系统,开源特性和Apache的Hadoop的有效处理,使之在大数据处理的行业标准。 TPCx-HS是通过对大数据领域的主要行业标准性能财团第一基准标准。 TPCx-HS就是Apache Hadoop的工作负荷的衍生物; Teragen,Terasort和Teravalidate。自2014年8月发布由TPC以来,全部18个结果公布(2016八月)都是基于内部部署,裸机硬件配置。本文将展示如何Hadoop的可在一个OpenStack的云使用OpenStack的撒哈拉项目进行部署和TPCx-HS可以如何被用来衡量和评估云下的测试(CUT)的性能。它也将显示一个OpenStack的云如何进行优化,以获得TPCx-HS的云上的性能,以尽可能地匹配于裸机配置。最后,将分享基于Hadoop的云验证的概念验证(POC)成果和经验,这是由戴尔的开源解决方案团队进行的一项研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号