【24h】

V-Hadoop: Virtualized Hadoop using containers

机译:V-Hadoop:使用容器的虚拟化Hadoop

获取原文

摘要

MapReduce is a popular programming model used to process large amounts of data by exploiting parallelism. Open-source implementations of MapReduce such as Hadoop are generally best suited for large, homogeneous clusters of commodity machines. However, many businesses cannot afford to invest in such infrastructure and others are reluctant to use cloud services due to data security and privacy concerns. In this paper, we present V-Hadoop, a framework that leverages Linux containers to allow users to run Hadoop jobs efficiently without requiring large, expensive, physical machine clusters. We describe our design and implementation of V-Hadoop and show that it can effectively support cluster-level parallelism. We experimentally demonstrate that V-Hadoop is a viable solution that performs competitively compared to solutions designed for large clusters.
机译:MapReduce是一种流行的编程模型,用于通过利用并行性来处理大量数据。 MapReduce的开源实现(例如Hadoop)通常最适合大型,同类的商用机器集群。但是,由于数据安全性和隐私问题,许多企业无力投资于此类基础架构,而另一些企业则不愿使用云服务。在本文中,我们介绍了V-Hadoop,这是一个利用Linux容器的框架,允许用户有效地运行Hadoop作业,而无需大型,昂贵的物理机器集群。我们描述了V-Hadoop的设计和实现,并表明它可以有效地支持集群级并行性。我们通过实验证明,与为大型集群设计的解决方案相比,V-Hadoop是一种具有竞争优势的可行解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号