首页> 外文会议>International conference on algorithms and architectures for parallel processing >Hmfs: Efficient Support of Small Files Processing over HDFS
【24h】

Hmfs: Efficient Support of Small Files Processing over HDFS

机译:HMF:有效支持HDFS上的小文件处理

获取原文
获取外文期刊封面目录资料

摘要

The storage and access of massive small files are one of the challenges in the design of distributed file system. Hadoop distributed file system (HDFS) is primarily designed for reliable storage and fast access of very big files while it suffers a performance penalty with increasing number of small files. A middleware called Hmfs is proposed in this paper to improve the efficiency of storing and accessing small files on HDFS. It is made up of three layers, file operation interfaces to make it easier for software developers to submit different file requests, file management tasks to merge small files into big ones or extract small files from big ones in the background, and file buffers to improve the I/O performance. Hmfs boosts the file upload speed by using asynchronous write mechanism and the file download speed by adopting prefetching and caching strategy. The experimental results show that Hmfs can help to obtain high speed of storage and access for massive small files on HDFS.
机译:海量小文件的存储和访问是分布式文件系统设计中的挑战之一。 Hadoop分布式文件系统(HDFS)主要用于可靠存储和快速访问非常大的文件,而随着小文件数量的增加,它会遭受性能损失。本文提出了一种称为Hmfs的中间件,以提高HDFS上存储和访问小文件的效率。它由三层组成,文件操作界面使软件开发人员可以更轻松地提交不同的文件请求,文件管理任务可以将小文件合并为大文件,或在后台从大文件中提取小文件,文件缓冲区也可以进行改进I / O性能。 Hmfs通过使用异步写入机制来提高文件上传速度,并通过采用预取和缓存策略来提高文件下载速度。实验结果表明,Hmfs可以帮助实现HDFS上大量海量小文件的高速存储和访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号