首页> 外文会议>International Conference on Network-Based Information Systems >A Load-Balancing Algorithm for Hadoop Distributed File System
【24h】

A Load-Balancing Algorithm for Hadoop Distributed File System

机译:Hadoop分布式文件系统的负载均衡算法

获取原文

摘要

Hadoop Distributed File System (HDFS) is developed to store a huge volume of data. Files are divided into blocks and the replicated blocks are then stored on many DataNodes in a distributed manner. Although doing so makes HDFS fault tolerant, the random nature of the default block placement strategy may lead to load imbalance among the DataNodes. Moreover, the built-in load-balancing algorithm Balancer may reduce the performance and consume lots of network resources. Therefore in this paper we consider all the situations that may influence the load-balancing state and propose a new load-balancing algorithm. In the proposed algorithm a new role named BalanceNode is introduced to help in matching heavy-loaded and light-loaded DataNodes, so those light-loaded nodes can share part of the load from heavy-loaded ones. The simulation results show that our algorithm can achieve a good load-balancing state in the HDFS compared with two existing algorithms.
机译:Hadoop分布式文件系统(HDFS)旨在存储大量数据。将文件分为多个块,然后将复制的块以分布式方式存储在许多DataNode上。尽管这样做可以使HDFS容错,但是默认块放置策略的随机性质可能会导致DataNode之间的负载不平衡。此外,内置的负载平衡算法Balancer可能会降低性能并消耗大量网络资源。因此,在本文中,我们考虑了可能影响负载均衡状态的所有情况,并提出了一种新的负载均衡算法。在提出的算法中,引入了一个名为BalanceNode的新角色,以帮助匹配重负载和轻负载的DataNode,以便那些轻负载节点可以分担重负载节点的部分负载。仿真结果表明,与现有的两种算法相比,我们的算法可以在HDFS中达到良好的负载均衡状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号