首页> 外文会议>IEEE Smart World Congress >SHAstor: A Scalable HDFS-Based Storage Framework for Small-Write Efficiency in Pervasive Computing
【24h】

SHAstor: A Scalable HDFS-Based Storage Framework for Small-Write Efficiency in Pervasive Computing

机译:Shastor:用于普遍计算的小写效率的基于缩放的基于HDFS的存储框架

获取原文

摘要

It is well known that small files are often created and accessed in pervasive computing in which information is processed with limited resources via linking with objects as encountered. And the Hadoop framework, as a de facto big data processing platform though very popular in practice, cannot effectively process the small files. In this paper, we propose a scalable HDFS-based storage framework, named SHAstor, to improve the throughput in processing of small-writes for pervasive computing paradigm. Compared to the classic HDFS, the essence of this approach is to merge the incoming small writes into a large chunk of data, either at client side or at server side, and then store it as a big file in the framework. As a consequence, this could substantially reduce the number of small files to process the pervasively gathered information. To reach this goal, the framework takes the HDFS as the basis and adds three extra modules for merging and indexing the small files during the read/write operations in pervasive applications are performed. To further facilitate this process, a new ancillary namenode is also optionally installed to store the index table. With this optimization, SHAstor can not only optimize the small-writes, but also scale out with the number of datanodes to improve the performance of pervasive applications.
机译:众所周知,通常在普遍的计算中创建和访问小文件,其中通过与遇到的对象链接,使用有限的资源处理信息。和Hadoop框架,作为事实上的大数据处理平台虽然在实践中非常流行,无法有效地处理小文件。在本文中,我们提出了一种基于HDFS的存储框架,名为Shastor,提高了普及计算范例的小写处理的吞吐量。与经典的HDFS相比,这种方法的本质是将传入的小写合并到客户端或服务器端的大块数据中,然后将其存储为框架中的一个大文件。结果,这可以大大减少用于处理普遍收集信息的小文件的数量。为了实现这一目标,该框架将HDFS作为基础,并在执行普遍应用程序中读取/写入操作期间,在普遍应用程序中进行合并和索引小文件,添加了三个额外的模块。为了进一步促进此过程,还可选择新的辅助NameNode来存储索引表。通过这种优化,Shastor不仅可以优化小写,还可以与DataNode数量缩放,以提高普及应用的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号