首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems
【24h】

HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems

机译:HBA:大型基于群集的存储系统的分布式元数据管理

获取原文
获取原文并翻译 | 示例
           

摘要

An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing metadata management within a group of metadata servers. This paper presents a novel technique called HBA (Hierarchical Bloom filter Arrays) to map filenames to the metadata servers holding their metadata. Two levels of probabilistic arrays, namely, Bloom filter arrays, with different level of accuracies, are used on each metadata server. One array, with lower accuracy and representing the distribution of the entire metadata, trades accuracy for significantly reduced memory overhead, while the other array, with higher accuracy, caches partial distribution information and exploits the temporal locality of file access patterns. Both arrays are replicated to all metadata servers to support fast local lookups. We evaluate HBA through extensive trace-driven simulations and an implementation in Linux. Simulation results show our HBA design to be highly effective and efficient in improving performance and scalability of file systems in clusters with 1,000 to 10,000 nodes (or super-clusters) and with the amount of data in the Peta-byte scale or higher. Our implementation indicates that HBA can reduce metadata operation time of a single-metadata-server architecture by a factor of up to 43.9 when the system is configured with 16 metadata servers.
机译:用于文件映射或文件查找的高效且分布式方案对于在一组元数据服务器中分散元数据管理至关重要。本文提出了一种称为HBA(Herarchical Bloom过滤器数组)的新颖技术,用于将文件名映射到保存其元数据的元数据服务器。每个元数据服务器上使用两个级别的概率数组,即具有不同级别的准确性的Bloom过滤器数组。一个精度较低并表示整个元数据分布的数组,就以精度为代价以显着减少内存开销,而另一个精度较高的数组则缓存了部分分布信息,并利用了文件访问模式的时间局部性。将两个阵列都复制到所有元数据服务器,以支持快速本地查找。我们通过广泛的跟踪驱动模拟和Linux实施对HBA进行评估。仿真结果表明,我们的HBA设计在提高具有1,000到10,000个节点(或超级集群)且数据量在PB级或更高级别的群集中的文件系统的性能和可伸缩性方面非常高效。我们的实现表明,当系统配置有16个元数据服务器时,HBA可以将单元数据服务器体系结构的元数据操作时间减少多达43.9倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号