首页> 外文会议>Provenance and annotation of data and processes >Understanding Collaborative Studies through Interoperable Workflow Provenance
【24h】

Understanding Collaborative Studies through Interoperable Workflow Provenance

机译:通过可互操作的工作流出处了解协作研究

获取原文
获取原文并翻译 | 示例

摘要

The provenance of a data product contains information about how the product was derived, and is crucial for enabling scientists to easily understand, reproduce, and verify scientific results. Currently, most provenance models are designed to capture the provenance related to a single run, and mostly executed by a single user. However, a scientific discovery is often the result of methodical execution of many scientific workflows with many datasets produced at different times by one or more users. Further, to promote and facilitate exchange of information between multiple workflow systems supporting provenance, the Open Provenance Model (OPM) has been proposed by the scientific workflow community. In this paper, we describe a new query model that captures implicit user collaborations. We show how this model maps to OPM and helps to answer collaborative queries, e.g., identifying combined workflows and contributions of users collaborating on a project based on the records of previous workflow executions. We also adopt and extend the high-level Query Language for Provenance (QLP) with additional constructs, and show how these extensions allow non-expert users to express collaborative provenance queries against this model easily and concisely. Furthermore, we adopt the Provenance Challenge 3 (PC3) workflows as a collaborative and interoperable usecase scenario, where different stages of the workflow are executed in three different workflow environments -Kepler, Taverna, and WSVLAM. Through this usecase, we demonstrate how we can establish and understand collaborative studies through interoperable workflow provenance.
机译:数据产品的来源包含有关产品来源的信息,这对于使科学家能够轻松理解,复制和验证科学结果至关重要。当前,大多数出处模型都旨在捕获与单次运行相关的出处,并且大多由单个用户执行。但是,科学发现通常是许多科学工作流有条不紊地执行的结果,其中有一个或多个用户在不同时间生成了许多数据集。此外,为了促进和促进支持来源的多个工作流系统之间的信息交换,科学工作流社区已经提出了开放源模型(OPM)。在本文中,我们描述了一个捕获隐式用户协作的新查询模型。我们将展示此模型如何映射到OPM并帮助回答协作查询,例如,基于先前的工作流执行记录来识别组合的工作流以及在项目上进行协作的用户的贡献。我们还采用其他构架采用和扩展了高级起源查询语言(QLP),并展示了这些扩展如何使非专家用户可以轻松,简洁地表达针对此模型的协作起源查询。此外,我们采用了Provenance Challenge 3(PC3)工作流作为协作和可互操作的用例场景,其中在三个不同的工作流环境(Kepler,Taverna和WSVLAM)中执行工作流的不同阶段。通过这个用例,我们演示了如何通过可互操作的工作流来源来建立和理解协作研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号