首页> 外文期刊>Journal of network and computer applications >Combination of data replication and scheduling algorithm for improving data availability in Data Grids
【24h】

Combination of data replication and scheduling algorithm for improving data availability in Data Grids

机译:数据复制和调度算法相结合,可提高数据网格中的数据可用性

获取原文
获取原文并翻译 | 示例
           

摘要

Data Grid is a geographically distributed environment that deals with large-scale data-intensive applications. Effective scheduling in Grid can reduce the amount of data transferred among nodes by submitting a job to a node, where most of the requested data files are available. Data replication is another key optimization technique for reducing access latency and managing large data by storing data in a wisely manner. In this paper two algorithms are proposed, first a novel job scheduling algorithm called Combined Scheduling Strategy (CSS) that uses hierarchical scheduling to reduce the search time for an appropriate computing node. It considers the number of jobs waiting in queue, the location of required data for the job and the computing capacity of sites. Second a dynamic data replication strategy, called the Modified Dynamic Hierarchical Replication Algorithm (MDHRA) that improves file access time. This strategy is an enhanced version of Dynamic Hierarchical Replication (DHR) strategy. Data replication should be used wisely because the storage capacity of each Grid site is limited. Thus, it is important to design an effective strategy for the replication replacement. MDHRA replaces replicas based on the last time the replica was requested, number of access, and size of replica. It selects the best replica location from among the many replicas based on response time that can be determined by considering the data transfer time, the storage access latency, the replica requests that waiting in the storage queue and the distance between nodes. The simulation results demonstrate the proposed replication and scheduling strategies give better performance compared to the other algorithms.
机译:数据网格是一个地理分布的环境,可处理大规模的数据密集型应用程序。网格中的有效调度可以通过将作业提交给大多数请求的数据文件可用的节点来减少节点之间传输的数据量。数据复制是另一种关键的优化技术,可通过以明智的方式存储数据来减少访问延迟并管理大型数据。本文提出了两种算法,第一种是称为组合调度策略(CSS)的新颖的作业调度算法,该算法使用分层调度来减少适当计算节点的搜索时间。它考虑队列中等待的作业数,作业所需数据的位置以及站点的计算能力。第二种是动态数据复制策略,称为改进的动态分层复制算法(MDHRA),可以缩短文件访问时间。此策略是动态分层复制(DHR)策略的增强版本。数据复制应该明智地使用,因为每个Grid站点的存储容量是有限的。因此,设计有效的复制替换策略很重要。 MDHRA根据上次请求副本的时间,访问次数和副本的大小来替换副本。它根据可通过考虑数据传输时间,存储访问延迟,副本请求在存储队列中等待的副本以及节点之间的距离来确定的响应时间,从众多副本中选择最佳副本位置。仿真结果表明,与其他算法相比,所提出的复制和调度策略具有更好的性能。

著录项

  • 来源
    《Journal of network and computer applications》 |2013年第2期|711-722|共12页
  • 作者单位

    Department of Computer Science and Engineering, Birjand University of Technology, Postal Code 97175-569, Birjand, Iran;

    Department of Computer Science and Engineering, College of Electerical and Computer Engineering, Shiraz University, MollaSadra Avenue, Shiraz, Iran;

    Department of Computer Science and Engineering, College of Electrical and Computer Engineering, Birjand University, Avini Avenue, Birjand, Iran;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data grid; data replication; job scheduling; simulation;

    机译:数据网格;数据复制;工作安排;模拟;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号