首页> 外文期刊>Journal of supercomputing >MBFS: a parallel metadata search method based on Bloomfilters using MapReduce for large-scale file systems
【24h】

MBFS: a parallel metadata search method based on Bloomfilters using MapReduce for large-scale file systems

机译:MBFS:基于Bloomfilters的并行元数据搜索方法,该方法使用MapReduce用于大型文件系统

获取原文
获取原文并翻译 | 示例
       

摘要

The metadata search is an important way to access and manage file systems. Many solutions have been proposed to tackle performance issue of metadata search. However, the existing solutions build a separate metadata index at the internal or external file system through the related data structure or database use semantics and event-notification method to construct the index structure, utilize the sampling-based method to conduct direct metadata search on the namespace, face problems of the high I/O overhead for maintaining consistency between metadata indexes and metadata, have enormous space overhead for metadata indexes storing and low accuracy of results and so on. To address these problems, this paper presents MBFS, a fast, accurate and lightweight metadata search method based on multi-dimensional Bloomfilters. We create a multi-dimensional Bloomfilter structure on the basis of the directory entry that can prune sub-trees to narrow the search scope of namespace. MBFS is capable of producing fast and accurate answers for a class of complex search over a file system after consuming a small number of disk accesses. MBFS residing in the file system does not need additional I/O overhead to maintain consistency. MBFS consists of Bloomfilters which are composed of bits, so it is a lightweight metadata search method that consumes marginal space overhead. Moreover, MBFS employs MapReduce for speeding up search under the environment of multiple metadata servers. Extensive experiments are conducted to prove the effectiveness of MBFS. The experimental results show that MBFS can achieve an excellent performance not only on the search latency, but also on the accuracy of results with low space and time overhead.
机译:元数据搜索是访问和管理文件系统的重要方式。已经提出了许多解决方案来解决元数据搜索的性能问题。然而,现有的解决方案通过相关的数据结构或数据库在内部或外部文件系统上构建单独的元数据索引,使用语义和事件通知方法来构造索引结构,利用基于采样的方法对数据库进行直接元数据搜索。命名空间面临着用于维护元数据索引和元数据之间一致性的高I / O开销的问题,具有用于元数据索引存储的巨大空间开销以及结果的准确性低等问题。为了解决这些问题,本文提出了MBFS,一种基于多维Bloomfilters的快速,准确和轻量级的元数据搜索方法。我们基于目录条目创建多维Bloomfilter结构,该结构可以修剪子树以缩小名称空间的搜索范围。 MBFS能够在消耗少量磁盘访问后为文件系统上的一类复杂搜索生成快速而准确的答案。驻留在文件系统中的MBFS不需要额外的I / O开销即可保持一致性。 MBFS由由位组成的Bloomfilters组成,因此它是一种轻量级的元数据搜索方法,消耗边际空间开销。此外,MBFS使用MapReduce在多个元数据服务器的环境下加快搜索速度。进行了广泛的实验以证明MBFS的有效性。实验结果表明,MBFS不仅可以在搜索等待时间上获得出色的性能,而且在空间和时间开销较小的情况下也可以实现结果的准确性。

著录项

  • 来源
    《Journal of supercomputing》 |2016年第8期|3006-3032|共27页
  • 作者单位

    Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China|Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China|Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China|Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China|Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China|Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China|Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China;

    Space Star Technol Co Ltd, Beijing 100086, Peoples R China;

    Space Star Technol Co Ltd, Beijing 100086, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Large-scale file systems; Parallel metadata search; Fast and accurate; Lightweight; Multi-dimensional Bloomfilters;

    机译:大型文件系统;并行元数据搜索;快速准确;轻量级;多维Bloomfilters;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号