首页> 外文期刊>Brazilian Computer Society. Journal >Reliable management of checkpointing and application data in?opportunistic grids
【24h】

Reliable management of checkpointing and application data in?opportunistic grids

机译:在机会网格中可靠地管理检查点和应用程序数据

获取原文
       

摘要

Opportunistic computational grids use idle processor cycles from shared machines to enable the execution of long-running parallel applications. Besides computational power, these applications may also consume and generate large amounts of data, requiring an efficient data storage and management infrastructure. In this article, we present an integrated middleware infrastructure that enables the use of not only idle processor cycles, but also unused disk space of shared machines. Our middleware enables the reliable distributed storage of application data in the shared machines in a redundant and fault-tolerant way. A?checkpointing-based mechanism monitors the execution of parallel applications, saves periodical checkpoints in the shared machines, and in case of node failures, supports the application migration across heterogeneous grid nodes. We evaluate the feasibility of our middleware using experiments and simulations. Our evaluation shows that the proposed middleware promotes important improvements in grid data management reliability while imposing a low performance overhead.
机译:机会计算网格使用共享计算机上的空闲处理器周期来执行长时间运行的并行应用程序。除了计算能力外,这些应用程序还可能消耗并生成大量数据,从而需要高效的数据存储和管理基础结构。在本文中,我们提出了一个集成的中间件基础结构,该结构不仅可以使用空闲处理器周期,还可以使用共享计算机的未使用磁盘空间。我们的中间件能够以冗余且容错的方式在共享计算机中可靠地分布式存储应用程序数据。基于检查点的机制监视并行应用程序的执行,在共享计算机中保存定期检查点,并且在节点出现故障的情况下,支持跨异构网格节点的应用程序迁移。我们使用实验和仿真评估中间件的可行性。我们的评估表明,所提出的中间件可促进网格数据管理可靠性的重要改进,同时降低性能开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号