首页> 外文期刊>International Journal of Applied Engineering Research >Comparitive Study of Hadoop over Containers and Hadoop Over Virtual Machine
【24h】

Comparitive Study of Hadoop over Containers and Hadoop Over Virtual Machine

机译:大型机器和Hadoop在虚拟机上的比较研究

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - The purpose of this paper is to elaborate the concept of Containerization which enables usage of Hadoop Infrastructure on a container which can allow clusters to run rapidly fast and accessible manner and it can be used for installing, processing and monitoring/analysis of Big Data under various configurations. Design/methodology/approach - This paper is a research-based comparative study of the VM services which deploy HADOOP framework upon them using a simple virtual machine and using container services from Docker. The analysis is done using a tool known as terasort, which has a program called Teragen which generates the random data on the HDFS cluster. Then it sorts the data using terasort services. The result is analysed upon the time taken by the Virtual Machine and Docker Container. Findings - The result of the research proves that the time taken during benchmark testing of the cluster on the container was much lesser than that of Virtual machine, and result also proves the better efficiency of the cluster on the container. The results shown in graphical analysis clearly indicates the difference in time taken by both the services. Docker Container implementation clearly takes lesser time when processing any task into the cluster. Originality/value - The originality in this research is the valuation and comparision of Docker Container with Virtual Machine to deploy HADOOP framework for the processing of data. The evaluation and time-based analysis is done solely by the authors using the tool known as Terasort. The value obtained during research can help in optimising the Data Centres which runs Virtual Machine to deploy their services.
机译:目的 - 本文的目的是详细说明集装箱化的概念,这使得能够在容器上使用Hadoop基础设施,这可以允许集群快速快速快速地运行,并且它可用于安装,处理和监控大数据的分析在各种配置下。设计/方法/方法 - 本文是一项基于研究的VM服务的比较研究,使用简单的虚拟机部署Hadoop框架,并使用从Docker的容器服务在它们上部署Hadoop框架。使用称为Terasort的工具进行分析,该工具具有称为Teragen的程序,该程序在HDFS集群上生成随机数据。然后它使用Terasort服务对数据进行排序。在虚拟机和Docker容器拍摄的时间内分析结果。结果 - 研究结果证明,集群对容器上集群的基准测试期间所花费的时间远小于虚拟机的时间,结果也证明了容器上集群的更好效率。图形分析中显示的结果清楚地表明了服务的差异。 Docker Container实现在将任意任务处理到群集中时清楚地持续时间。原创性/值 - 本研究中的原创性是Docker容器具有虚拟机的估值和比较,以部署用于处理数据的Hadoop框架。评估和时间分析仅由作者使用称为Terasort的工具完成。研究期间获得的值可以帮助优化运行虚拟机以部署其服务的数据中心。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号