Efficient File Accessing Techniques on Hadoop Distributed File Systems

Wei Qu1; Siyao Cheng1; Hongzhi Wang1

首页> 中文期刊>国际计算机前沿大会会议论文集 >Efficient File Accessing Techniques on Hadoop Distributed File Systems

Efficient File Accessing Techniques on Hadoop Distributed File Systems

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hadoop framework emerged at the right moment when traditional tools were powerless in terms of handling big data. Hadoop Distributed File System (HDFS) which serves as a highly fault-tolerance distributed file system in Hadoop, can improve the throughput of data access effectively. It is very suitable for the application of handling large amounts of datasets. However, Hadoop has the disadvantage that the memory usage rate in NameNode is so high when processing large amounts of small files that it has become the limit of the whole system. In this paper, we propose an approach to optimize the performance of HDFS with small files. The basic idea is to merge small files into a large one whose size is suitable for a block. Furthermore, indexes are built to meet the requirements for fast access to all files in HDFS. Preliminary experiment results show that our approach achieves better performance.

著录项

来源
《国际计算机前沿大会会议论文集》|2016年第1期|P.88-90|共3页
作者
Wei Qu1; Siyao Cheng1; Hongzhi Wang1;
展开▼
作者单位

[1]School of Computer Science and Technology,Harbin Institute of Technology,Harbin,China;

[1]School of Computer Science and Technology,Harbin Institute of Technology,Harbin,China;

[1]School of Computer Science and Technology,Harbin Institute of Technology,Harbin,China;

展开▼
原文格式 PDF
正文语种 CHI
中图分类社会科学丛书、文集、连续性出版物;
关键词
HDFS; Hadoop; Index; Small files;
入库时间 2023-07-26 01:31:35

相似文献

中文文献
外文文献
专利

Efficient File Accessing Techniques on Hadoop Distributed File Systems

摘要

著录项

相似文献

相关主题

期刊订阅