首页> 外文会议>IEEE International Symposium on Applied Computational Intelligence and Informatics >Evaluating the reproducibility cost of the scientific workflows
【24h】

Evaluating the reproducibility cost of the scientific workflows

机译:评估科学工作流程的可重复性成本

获取原文

摘要

In almost all research field scientific studies can be implemented by in silico experiments. They are modelled by scientific workflows which describes the data or control flow between the consecutive computational tasks. Since these experiments are data and compute intensive they need parallel and distributed infrastructures to be enacted (grids, clusters, clouds and supercomputers). The complexity of the infrastructures and the continuously changing environment faces us a big challenge in reproducibility, which is often needed for results sharing or for judging scientific claims in the scientists' community. The necessary parameters of reproducible workflows can be originated from different sources (infrastructural, third party, or related to the binaries), which may change or become unavailable during the process of re-execution. However in most cases the lack of the original parameters can be compensated by replacing, evaluating or simulating the value of the descriptors with some extra cost in order to make it reproducible. In this paper we give the expected cost of making a workflow reproducible or more precisely to determine the probability of making a workflow reproducible with more than a predefined cost C.
机译:在几乎所有的研究领域中,科学研究都可以通过计算机模拟实验来实现。它们由科学的工作流建模,该工作流描述了连续计算任务之间的数据或控制流。由于这些实验是数据和计算密集型的,因此需要制定并行和分布式的基础架构(网格,集群,云和超级计算机)。基础设施的复杂性和不断变化的环境给我们带来了可重复性方面的巨大挑战,这对于结果共享或判断科学家社区的科学主张通常是必需的。可重现的工作流的必要参数可以源自不同的来源(基础结构,第三方或与二进制文件有关),这些参数在重新执行过程中可能会更改或变得不可用。但是,在大多数情况下,可以通过用一些额外的成本替换,评估或模拟描述符的值来弥补原始参数的不足,以使其可重现。在本文中,我们给出了使工作流具有可复制性的预期成本,或更准确地说,是确定使工作流具有可复制性且超过预定成本C的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号