Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach

机译：使用缓存方法解决Hadoop分布式文件系统中的名称节点可伸缩性问题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hadoop is a distributed batch processing infrastructure which is currently being used for big data management. The foundation of Hadoop consists of Hadoop Distributed File System (HDFS). HDFS presents a client-server architecture comprised of a Name Node and many Data Nodes. The Name Node stores the metadata for the Data Nodes and Data Node stores application data. The Name Node holds file system metadata in memory, and thus the limit to the number of files in a file system is governed by the amount of memory on the Name Node. Thus when the memory on Name Node is full there is no further chance of increasing the cluster capacity. In this paper we have used the concept of cache memory for handling the issue of Name Node scalability. The focus of this paper is to highlight our approach that tries to enhance the current architecture and ensure that Name Node does not reach its threshold value soon.

机译：Hadoop是一种分布式批处理基础架构，目前正用于大数据管理。 Hadoop的基础包括Hadoop分布式文件系统（HDFS）。 HDFS提出了一种由名称节点和许多数据节点组成的客户端-服务器体系结构。名称节点存储数据节点的元数据，数据节点存储应用程序数据。名称节点将文件系统元数据保存在内存中，因此文件系统中文件数量的限制由名称节点上的内存量决定。因此，当“名称节点”上的内存已满时，就不再有增加群集容量的机会。在本文中，我们使用了高速缓存的概念来处理名称节点可伸缩性问题。本文的重点是强调我们的方法，该方法试图增强当前的体系结构并确保名称节点不会很快达到其阈值。

著录项

来源
《International Conference on Information Technology》|2014年|321-326|共6页
会议地点
作者
Mukhopadhyay Debajyoti; Agrawal Chetan; Maru Devesh; Yedale Pooja; Gadekar Pranav;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
cache storage; client-server systems; data handling; distributed databases; meta data; parallel processing; Big Data management; DataNodes; HDFS; Hadoop distributed file system; NameNode scalability; application data storage; cache approach; cache memory concept; client-server architecture; cluster capacity; distributed batch processing infrastructure; file system meta data; metadata storage; threshold value; Big data; Distributed databases; File systems; Information technology; Memory management; Random access memory; Cache; Data Node; HDFS; Hadoop; Name Node;

机译：缓存存储;客户端-服务器系统;数据处理;分布式数据库;元数据;并行处理;大数据管理;数据节点; HDFS; Hadoop分布式文件系统; NameNode可伸缩性;应用程序数据存储;缓存方法;缓存内存概念;客户端-服务器体系结构;集群容量;分布式批处理基础架构;文件系统元数据;元数据存储;阈值;大数据;分布式数据库;文件系统;信息技术;内存管理;随机访问内存;缓存;数据节点; HDFS; Hadoop;名称节点;

相似文献

外文文献
中文文献
专利

1. Comparative Study on Hadoop Distributed File System Based on Security Issues [J] . Hadeer Mahmoud, Abdelfatah Hegazy, Mohamed H. Khafagy Asian Journal of Information Technology . 2017,第6期

机译：基于安全问题的Hadoop分布式文件系统比较研究
2. A write-friendly approach to manage namespace of Hadoop distributed file system by utilizing nonvolatile memory [J] . Choi Won Gi, Park Sanghyun Journal of supercomputing . 2019,第10期

机译：一种写友好的方法，通过利用非易失性内存来管理Hadoop分布式文件系统的名称空间
3. An overall approach to achieve load balancing for Hadoop Distributed File System [J] . Lin Chi-Yi, Lin Ying-Chen International journal of web and grid services . 2017,第4期

机译：一种实现Hadoop分布式文件系统负载平衡的整体方法
4. Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach [C] . Mukhopadhyay Debajyoti, Agrawal Chetan, Maru Devesh, International Conference on Information Technology . 2014

机译：使用缓存方法解决Hadoop分布式文件系统中的名称节点可伸缩性问题
5. Distributed systems in small scale research environments: Hadoop and the EM algorithm. [D] . Remington, Jason. 2011

机译：小型研究环境中的分布式系统：Hadoop和EM算法。
6. Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework [O] . Steven Lewis, Attila Csordas, Sarah Killcoyne, 2012

机译：Hydra：可扩展的蛋白质组搜索引擎利用Hadoop分布式计算框架
7. Addressing NameNode Scalability Issue in Hadoop Distributed File System using Cache Approach [O] . Mukhopadhyay, Debajyoti, Agrawal, Chetan, Maru, Devesh, 2014

机译：解决Hadoop分布式文件系统中的NameNode可伸缩性问题使用缓存方法

Addressing Name Node Scalability Issue in Hadoop Distributed File System Using Cache Approach

摘要

著录项

相似文献

相关主题

期刊订阅