首页> 外文会议>International Conference on Information Technology >Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach
【24h】

Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach

机译:使用缓存方法解决Hadoop分布式文件系统中的名称节点可伸缩性问题

获取原文

摘要

Hadoop is a distributed batch processing infrastructure which is currently being used for big data management. The foundation of Hadoop consists of Hadoop Distributed File System (HDFS). HDFS presents a client-server architecture comprised of a Name Node and many Data Nodes. The Name Node stores the metadata for the Data Nodes and Data Node stores application data. The Name Node holds file system metadata in memory, and thus the limit to the number of files in a file system is governed by the amount of memory on the Name Node. Thus when the memory on Name Node is full there is no further chance of increasing the cluster capacity. In this paper we have used the concept of cache memory for handling the issue of Name Node scalability. The focus of this paper is to highlight our approach that tries to enhance the current architecture and ensure that Name Node does not reach its threshold value soon.
机译:Hadoop是一种分布式批处理基础架构,目前正用于大数据管理。 Hadoop的基础包括Hadoop分布式文件系统(HDFS)。 HDFS提出了一种由名称节点和许多数据节点组成的客户端-服务器体系结构。名称节点存储数据节点的元数据,数据节点存储应用程序数据。名称节点将文件系统元数据保存在内存中,因此文件系统中文件数量的限制由名称节点上的内存量决定。因此,当“名称节点”上的内存已满时,就不再有增加群集容量的机会。在本文中,我们使用了高速缓存的概念来处理名称节点可伸缩性问题。本文的重点是强调我们的方法,该方法试图增强当前的体系结构并确保名称节点不会很快达到其阈值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号