首页> 外文会议>International Workshop on Intelligent Communication and Social Networks >NCluster: Using Multiple Active Namenodes to Achieve High Availability for HDFS
【24h】

NCluster: Using Multiple Active Namenodes to Achieve High Availability for HDFS

机译:ncluster:使用多个活动NameNodes以实现HDFS的高可用性

获取原文

摘要

Hadoop HDFS is an open source project from Apache Software Foundation for scalable, distributed computing and data storage. HDFS has become a critical component in today's cloud computing environment and a wide range of applications built on top of it. However, the initial design of HDFS has introduced a single-point-of-failure, HDFS contains only one active namenode, if this namenode experiences software or hardware failures, the whole HDFS cluster is unusable until the recovery of namenode is finished, this is the reason why people are reluctant to deploy HDFS for an application whose requirement is high availability. In this paper, we present a solution to enable the high availability for HDFS's namenode through efficient metadata replication. Our solution has two major advantages than existing ones: we utilize multiple active namenodes, instead of one, to build a cluster to serve request of metadata simultaneously. We implements a pub/sub system to handle the metadata replication process across these active namonodes efficiently. Based on the solution we implement a prototype called NCluster and integrate it with HDFS. We also evaluate NCluster to exhibit its feasibility and effectiveness. The experimental results show that our solution performs well with low replication cost, good throughput and scalability.
机译:Hadoop HDFS是Apache软件基础的开源项目,可用于可扩展,分布式计算和数据存储。 HDFS已成为当今云计算环境中的关键组成部分和内置于其顶部的广泛应用程序。但是,HDFS的初始设计引入了单点故障,HDFS仅包含一个Active NameNode,如果此NameNode体验软件或硬件故障,则整个HDFS群集无法使用,直到NameNode恢复完成,这是人们不愿意部署HDF的申请的原因,其要求高的可用性。在本文中,我们通过有效的元数据复制,提出了一种解决方案,以实现HDFS的NameNode的高可用性。我们的解决方案具有比现有的两个主要优点:我们利用多个活动Nameodes,而不是一个,以构建群集以同时为元数据提供请求。我们实现了PUB / SUB系统,以有效地处理这些活动NamONEDE的元数据复制过程。基于解决方案,我们实现称为NCluster的原型,并将其与HDF集成。我们还评估Ncluster展示其可行性和有效性。实验结果表明,我们的解决方案以低复制成本,良好的吞吐量和可扩展性表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号