首页> 外文会议>IEEE International Conference on Cluster Computing >Zput: A speedy data uploading approach for the Hadoop Distributed File System
【24h】

Zput: A speedy data uploading approach for the Hadoop Distributed File System

机译:Zput:Hadoop分布式文件系统的快速数据上传方法

获取原文
获取外文期刊封面目录资料

摘要

Hadoop Distributed File System (HDFS) is the storage component of the Hadoop framework, which is designed for maintaining and processing huge datasets efficiently among cluster nodes. To cooperate with MapReduce, the computation infrastructure of Hadoop, data is required to be uploaded from local file systems to HDFS. Unfortunately when data is of massive scale, the uploading procedure becomes extremely time-consuming, which causes serious delay for urgent tasks. This primary contribution of this paper is the proposition of Zput, a speedy data uploading mechanism which can significantly accelerate uploading by using metadata mapping approach. After the implementation is described and corresponding advantages are narrated, disadvantages are also analyzed and eliminated by using an approach named remote block placement. Evaluation results show this new mechanism can reduce the running time of uploading process by about 60-90%, and the remote block placement can boost the course of block distribution by about 30–40%, while maintaining the complete compatibility for upper-layer applications.
机译:Hadoop分布式文件系统(HDFS)是Hadoop框架的存储组件,旨在用于在群集节点之间高效地维护和处理大型数据集。为了与Hadoop的计算基础架构MapReduce合作,需要将数据从本地文件系统上载到HDFS。不幸的是,当数据规模巨大时,上载过程变得非常耗时,这严重地延迟了紧急任务。本文的主要贡献是Zput的主张,Zput是一种快速的数据上传机制,可以通过使用元数据映射方法显着加快上传速度。在描述了实现方式并叙述了相应的优点之后,还使用称为远程块放置的方法来分析和消除了缺点。评估结果表明,这种新机制可以将上载过程的运行时间减少大约60-90%,并且远程块放置可以将块分配的过程提高大约30-40%,同时保持与上层应用程序的完全兼容性。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号