...
首页> 外文期刊>Procedia Computer Science >An Optimal Solution of Storing and Processing Small Image Files on Hadoop
【24h】

An Optimal Solution of Storing and Processing Small Image Files on Hadoop

机译:在Hadoop上存储和处理小图像文件的最佳解决方案

获取原文
           

摘要

The rapid development of the Internet, especially mobile Internet, makes it much easier for people to make social contacts online. Nowadays they tend to spend more and more time on social network service, producing a lot of image files. This brings a challenge to traditional standalone framework on handing the continued increasing image files. Therefore, it is advisable to find a new way to face the challenge. Hadoop is a notable, widely-used project for distributed storage and computations with high efficiency, data integrity, reliability and fault tolerance. Hadoop Distributed File System and MapReduce are two primary subprojects respectively for big data storage and computations. However, Hadoop do not provide any interface for image processing. Worse, both Hadoop Distributed File System and MapReduce have trouble processing large amount of small files, decreasing efficiency of files access and distributed computations. This prevents us from performing images processing actions on Hadoop. This paper proposes a method to optimize small image files storage on Hadoop and self-defines an input/output format to enable Hadoop to process image files.
机译:Internet的快速发展,尤其是移动Internet,使人们更容易在线进行社交联系。如今,他们倾向于在社交网络服务上花费越来越多的时间,产生大量图像文件。这给传统的独立框架带来了不断增长的图像文件带来的挑战。因此,建议找到一种应对挑战的新方法。 Hadoop是一个著名的,广泛使用的项目,用于分布式存储和计算,具有高效,数据完整性,可靠性和容错能力。 Hadoop分布式文件系统和MapReduce是分别用于大数据存储和计算的两个主要子项目。但是,Hadoop不提供任何用于图像处理的接口。更糟糕的是,Hadoop分布式文件系统和MapReduce都无法处理大量的小文件,从而降低了文件访问和分布式计算的效率。这使我们无法在Hadoop上执行图像处理操作。本文提出了一种在Hadoop上优化小型图像文件存储的方法,并自定义输入/输出格式以使Hadoop能够处理图像文件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号