首页> 外文会议>International Conference on Information Technology >Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach
【24h】

Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach

机译:使用缓存方法解决Hadoop分布式文件系统中的名称节点可伸缩性问题

获取原文

摘要

Hadoop is a distributed batch processing infrastructure which is currently being used for big data management. The foundation of Hadoop consists of Hadoop Distributed File System (HDFS). HDFS presents a client-server architecture comprised of a Name Node and many Data Nodes. The Name Node stores the metadata for the Data Nodes and Data Node stores application data. The Name Node holds file system metadata in memory, and thus the limit to the number of files in a file system is governed by the amount of memory on the Name Node. Thus when the memory on Name Node is full there is no further chance of increasing the cluster capacity. In this paper we have used the concept of cache memory for handling the issue of Name Node scalability. The focus of this paper is to highlight our approach that tries to enhance the current architecture and ensure that Name Node does not reach its threshold value soon.
机译:Hadoop是一种分布式批处理基础架构,目前正在用于大数据管理。 Hadoop的基础包括Hadoop分布式文件系统(HDFS)。 HDFS呈现由名称节点和许多数据节点组成的客户端 - 服务器架构。 名称节点存储数据节点和数据节点的元数据存储应用程序数据。 名称节点在内存中保存文件系统元数据,因此文件系统中文件数的限制由名称节点上的内存量管理。 因此,当名称节点上的内存已满时,没有进一步增加集群容量的可能性。 在本文中,我们使用了缓存内存的概念来处理名称节点可伸缩性问题。 本文的焦点是突出显示我们试图增强当前架构的方法,并确保姓名节点不快达到其阈值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号