【24h】

Distributed Popularity Based Replica Placement in Data Grid Environments

机译:数据网格环境中基于分布式流行度的副本放置

获取原文

摘要

Data grids support distributed data-intensive applications that need to access massive datasets stored around the world. Ensuring efficient access to such datasets is hindered by the high latencies of wide-area networks. To speed up access, files can be replicated so a user can access a nearby replica. Replication also provides improved availability, decreased bandwidth use, increased fault tolerance, and improved scalability. Since a grid environment is dynamic, resource availability, network latency, and user requests may change. To address these issues a dynamic replica placement strategy that adapts to changing behaviour is needed. In this paper, we introduce a highly distributed replica placement algorithm for hierarchical data grids. Our algorithm exploits data access histories to identify popular files and determines optimal replication locations to improve access performance by minimizing replication overhead (access and update) assuming a given traffic pattern. The problem is formulated using dynamic programming. We evaluate our algorithm using the OptorSim simulator and find that it offers shorter execution time and reduced bandwidth consumption compared to other dynamic replica placement methods.
机译:数据网格支持需要访问世界各地存储的大量数据集的分布式数据密集型应用程序。广域网的高延迟阻碍了对此类数据集的有效访问。为了加快访问速度,可以复制文件,以便用户可以访问附近的副本。复制还提供了更高的可用性,减少了带宽使用,提高了容错能力并提高了可伸缩性。由于网格环境是动态的,因此资源可用性,网络延迟和用户请求可能会更改。为了解决这些问题,需要一种适应变化的行为的动态副本放置策略。在本文中,我们介绍了一种用于分层数据网格的高度分布式副本放置算法。我们的算法利用数据访问历史记录来识别流行文件,并确定最佳复制位置,以在给定流量模式的情况下将复制开销(访问和更新)降至最低,从而提高访问性能。该问题是使用动态编程制定的。我们使用OptorSim模拟器评估了我们的算法,发现与其他动态副本放置方法相比,该算法可缩短执行时间并减少带宽消耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号