首页> 外文会议>IEEE International Conference on e-Science >A framework for scientific workflow reproducibility in the cloud
【24h】

A framework for scientific workflow reproducibility in the cloud

机译:云中科学工作流程可再现性的框架

获取原文

摘要

Workflow is a well-established means by which to capture scientific methods in an abstract graph of interrelated processing tasks. The reproducibility of scientific workflows is therefore fundamental to reproducible e-Science. However, the ability to record all the required details so as to make a workflow fully reproducible is a long-standing problem that is very difficult to solve. In this paper, we introduce an approach that integrates system description, source control, container management and automatic deployment techniques to facilitate workflow reproducibility. We have developed a framework that leverages this integration to support workflow execution, re-execution and reproducibility in the cloud and in a personal computing environment. We demonstrate the effectiveness of our approach by examining various aspects of repeatability and reproducibility on real scientific workflows. The framework allows workflow and task images to be captured automatically, which improves not only repeatability but also runtime performance. It also gives workflows portability across different cloud environments. Finally, the framework can also track changes in the development of tasks and workflows to protect them from unintentional failures.
机译:工作流是一种完善的方法,通过它可以在相互关联的处理任务的抽象图中捕获科学方法。因此,科学工作流程的可重现性是可重现电子科学的基础。但是,记录所有必需细节以使工作流完全可重现的能力是一个长期存在的问题,很难解决。在本文中,我们介绍了一种将系统描述,源代码控制,容器管理和自动部署技术集成在一起的方法,以促进工作流的可重复性。我们已经开发了一个框架,该框架利用此集成来支持工作流执行,在云中以及在个人计算环境中的重新执行和可再现性。我们通过检查真实科学工作流程中可重复性和可再现性的各个方面,证明了我们方法的有效性。该框架允许自动捕获工作流和任务图像,这不仅提高了可重复性,而且还提高了运行时性能。它还为工作流提供了跨不同云环境的可移植性。最后,该框架还可以跟踪任务和工作流开发中的更改,以保护它们免受意外失败的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号