【24h】

DiSK: A distributed shared disk cache for HPC environments

机译:DiSK:适用于HPC环境的分布式共享磁盘缓存

获取原文

摘要

Data movement within high performance environments can be a large bottleneck to the overall performance of programs. With the addition of continuous storage and usage of older data, the back end storage is becoming a larger problem than the improving network and computational nodes. This has led us to develop a Distributed Shared Disk Cache, DiSK, to reduce the dependence on these back end storage systems. With DiSK requested files will be distributed across nodes in order to reduce the amount of requests directed to the archives. DiSK has two key components. One is a Distributed Metadata Management, DIMM, scheme that allows a centralized manager to access what data is available in the system. This is accomplished through the use of a counter-based bloomfilter with locality checks in order to reduce false positives and false negatives. The second component is a method of replication called Differentiable Replication, DiR. The novelty of DiR is that the requirements of the files and capabilities of underlying nodes are taken into consideration for replication. This allows for a varying degree of replication depending on the file. This customization of DiSK yields better performance than the conventional archive system.
机译:高性能环境中的数据移动可能成为程序整体性能的很大瓶颈。随着连续存储的增加和旧数据的使用,与改进网络和计算节点相比,后端存储已成为一个更大的问题。这导致我们开发了分布式共享磁盘缓存DiSK,以减少对这些后端存储系统的依赖。使用DiSK,请求的文件将分布在各个节点上,以减少定向到存档的请求量。 DiSK具有两个关键组成部分。一种是分布式元数据管理DIMM方案,它允许集中管理器访问系统中可用的数据。这是通过使用基于计数器的Bloomfilter进行局部检查来实现的,以减少误报和误报。第二个组件是一种称为差异复制DiR的复制方法。 DiR的新颖之处在于复制时要考虑文件的需求和底层节点的功能。这允许根据文件的不同程度的复制。与常规存档系统相比,DiSK的这种自定义产生了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号