首页> 外文会议>IEEE International Conference on e-Science >PRUNE: A preserving run environment for reproducible scientific computing
【24h】

PRUNE: A preserving run environment for reproducible scientific computing

机译:PRUNE:保留运行环境以进行可重复的科学计算

获取原文

摘要

Computing as a whole suffers from a crisis of reproducibility. Programs executed in one context are astonishingly hard to reproduce in another context, resulting in wasted effort by people and general distrust of results produced by computer. The root of the problem lies in the fact that every program has implicit dependencies on data and execution environment which are rarely understood by the end user. To address this problem, we present PRUNE, the Preserving Run Environment. In PRUNE, every task to be executed is wrapped in a functional interface and coupled with a strictly defined environment. The task is then executed by PRUNE rather than the user to ensure reproducibility. As a scientific workflow evolves in PRUNE, a growing but immutable tree of derived data is created. The provenance of every item in the system can be precisely described, facilitating sharing and modification between collaborating researchers, along with efficient management of limited storage space. We present the user interface and the initial prototype of PRUNE, and demonstrate its application in matching records and comparing surnames in U.S. Censuses.
机译:整个计算遭受可再现性危机。在一种情况下执行的程序非常难以在另一种情况下进行复制,从而导致人们的工作浪费和对计算机产生的结果的普遍不信任。问题的根源在于每个程序对数据和执行环境都有隐式的依赖关系,而最终用户很少了解这些依赖关系。为了解决这个问题,我们提出了PRUNE,即保留运行环境。在PRUNE中,每个要执行的任务都包装在功能接口中,并与严格定义的环境结合在一起。然后,该任务由PRUNE而不是用户执行,以确保可重复性。随着科学工作流程在PRUNE中的发展,将创建一个不断增长的但不可变的派生数据树。可以精确描述系统中每个项目的出处,从而促进合作研究人员之间的共享和修改,以及对有限存储空间的有效管理。我们介绍了PRUNE的用户界面和初始原型,并演示了其在匹配记录和比较美国人口普查中的姓氏方面的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号