首页> 外文会议>Cluster Computing and the Grid, 2009. CCGRID '09 >File Clustering Based Replication Algorithm in a Grid Environment
【24h】

File Clustering Based Replication Algorithm in a Grid Environment

机译:网格环境中基于文件聚类的复制算法

获取原文
获取原文并翻译 | 示例

摘要

Replication in grid file systems can significantly improve I/O performance of data-intensive applications. However, most of existing replication techniques apply to individual files, which may introduce inefficient replication overheads for a large number of files. We propose a file clustering based replication algorithm for grid file systems. Our algorithm groups files according to a relationship of simultaneous accesses between files and stores replicas of the clustered files into storage nodes, to satisfy expected most of future read access times to the clustered files and replication times for individual files being minimized under the given storage capacity limitation. Our experiments on a given grid environment, 20 nodes of 5 sites, suggest that the proposed algorithm achieves accurate file clustering and efficient replica management; our clustering policy with the file cluster size limit of 5120 MB and the storage capacity limit for replicas of 10240 MB exhibits 1.58 times efficiency than the policy that never groups related files. The results also indicate that the overheads required for introducing our algorithm significantly affect I/O performance of running applications.
机译:网格文件系统中的复制可以显着提高数据密集型应用程序的I / O性能。但是,大多数现有复制技术适用于单个文件,这可能会导致大量文件的复制开销低下。我们提出了一种基于文件聚类的网格文件系统复制算法。我们的算法根据文件之间同时访问的关系对文件进行分组,并将集群文件的副本存储到存储节点中,以满足未来对集群文件的预期大多数读取访问时间,并在给定存储容量下将单个文件的复制时间降至最低局限性。我们在给定的网格环境(5个站点的20个节点)上进行的实验表明,该算法可实现准确的文件聚类和有效的副本管理;我们的群集策略的文件群集大小限制为5120 MB,副本的存储容量限制为10240 MB,其效率是从不对相关文件进行分组的策略的1.58倍。结果还表明,引入我们的算法所需的开销会显着影响正在运行的应用程序的I / O性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号