首页> 外文期刊>IEICE transactions on information and systems >A Distributed and Cooperative NameNode Cluster for a Highly-Available Hadoop Distributed File System
【24h】

A Distributed and Cooperative NameNode Cluster for a Highly-Available Hadoop Distributed File System

机译:适用于高可用性Hadoop分布式文件系统的分布式协作NameNode集群

获取原文
           

摘要

Recently, Hadoop has attracted much attention from engineers and researchers as an emerging and effective framework for Big Data . HDFS (Hadoop Distributed File System) can manage a huge amount of data with high performance and reliability using only commodity hardware. However, HDFS requires a single master node, called a NameNode , to manage the entire namespace (or all the i-nodes) of a file system. This causes the SPOF (Single Point Of Failure) problem because the file system becomes inaccessible when the NameNode fails. This also causes a bottleneck of efficiency since all the access requests to the file system have to contact the NameNode . Hadoop 2.0 resolves the SPOF problem by introducing manual failover based on two NameNodes , Active and Standby . However, it still has the efficiency bottleneck problem since all the access requests have to contact the Active in ordinary executions. It may also lose the advantage of using commodity hardware since the two NameNodes have to share a highly reliable sophisticated storage. In this paper, we propose a new HDFS architecture to resolve all the problems mentioned above.
机译:最近,Hadoop作为 Big Data的新兴且有效的框架,引起了工程师和研究人员的广泛关注。 HDFS(Hadoop分布式文件系统)仅使用商用硬件即可以高性能和可靠性管理大量数据。但是,HDFS仅需要一个称为 NameNode的主节点来管理文件系统的整个名称空间(或所有i节点)。这会导致 SPOF(单点故障)问题,因为当 NameNode发生故障时,文件系统将变得不可访问。由于所有对文件系统的访问请求都必须联系 NameNode,因此这也会导致效率方面的瓶颈。 Hadoop 2.0通过引入基于两个 NameNodes Active和 Standby的手动故障转移,解决了SPOF问题。但是,它仍然存在效率瓶颈问题,因为在常规执行中,所有访问请求都必须联系 Active。由于两个 NameNode必须共享高度可靠的复杂存储,因此它也可能失去使用商品硬件的优势。在本文中,我们提出了一种新的HDFS架构来解决上述所有问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号