Linking multiple workflow provenance traces for interoperable collaborative science

机译：链接多个工作流程出处跟踪以实现可互操作的协作科学

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Scientific collaboration increasingly involves data sharing between separate groups. We consider a scenario where data products of scientific workflows are published and then used by other researchers as inputs to their workflows. For proper interpretation, shared data must be complemented by descriptive metadata. We focus on provenance traces, a prime example of such metadata which describes the genesis and processing history of data products in terms of the computational workflow steps. Through the reuse of published data, virtual, implicitly collaborative experiments emerge, making it desirable to compose the independently generated traces into global ones that describe the combined executions as single, seamless experiments. We present a model for provenance sharing that realizes this holistic view by overcoming the various interoperability problems that emerge from the heterogeneity of workflow systems, data formats, and provenance models. At the heart lie (i) an abstract workflow and provenance model in which (ii) data sharing becomes itself part of the combined workflow. We then describe an implementation of our model that we developed in the context of the Data Observation Network for Earth (DataONE) project and that can “stitch together” traces from different Kepler and Taverna workflow runs. It provides a prototypical framework for seamless cross-system, collaborative provenance management and can be easily extended to include other systems. Our approach also opens the door to new ways of workflow interoperability not only through often elusive workflow standards but through shared provenance information from public repositories.

机译：科学协作越来越多地涉及到各个小组之间的数据共享。我们考虑一种情况，其中发布科学工作流程的数据产品，然后由其他研究人员用作其工作流程的输入。为了正确解释，共享数据必须辅以描述性元数据。我们着重介绍起源痕迹，这是此类元数据的主要示例，它根据计算工作流程步骤描述了数据产品的起源和处理历史。通过重用已发布的数据，虚拟的，隐式的协作实验应运而生，这使得将独立生成的迹线组合成全局迹线，从而将组合执行描述为单个无缝实验是合乎需要的。我们提出了一种物产共享模型，该模型通过克服工作流程系统，数据格式和物产模型异质性所引起的各种互操作性问题，来实现这种整体观点。核心在于（i）抽象的工作流程和出处模型，其中（ii）数据共享本身成为组合工作流程的一部分。然后，我们描述了我们在地球数据观测网络（DataONE）项目的背景下开发的模型的实现，该模型可以“缝合”来自不同开普勒和塔韦纳工作流运行的跟踪。它为无缝的跨系统协作式物源管理提供了一个原型框架，并且可以轻松扩展为包括其他系统。我们的方法不仅通过通常难以捉摸的工作流程标准，而且通过来自公共存储库的共享出处信息，为工作流程互操作性的新方法打开了大门。

著录项

来源
《2010 5th Workshop on Workflows in Support of Large-Scale Science》|2010年|p.1-8|共8页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类一般性问题;
关键词

相似文献

外文文献
中文文献
专利

1. Linked provenance data: A semantic Web-based approach to interoperable workflow traces [J] . Li Ding, James Michaelis, Jim McCusker, Future generation computer systems . 2011,第6期

机译：链接的来源数据：一种基于语义Web的可互操作的工作流跟踪方法
2. P-PIF: a ProvONE provenance interoperability framework for analyzing heterogeneous workflow specifications and provenance traces [J] . Ajinkya Prabhune, Aaron Zweig, Rainer Stotzka, Distributed and Parallel Databases . 2018,第1期

机译：P-PIF：一种ProvONE来源互操作性框架，用于分析异构工作流程规范和来源跟踪
3. An agent-based approach for capturing and linking provenance in geoscience workflows [J] . Narock Tom, Yoon Victoria Computers & geosciences . 2015,第juna期

机译：基于代理的方法来捕获和链接地球科学工作流中的出处
4. Linking multiple workflow provenance traces for interoperable collaborative science [C] . {missing} Workshop on Workflows in Support of Large-Scale Science . 2010

机译：链接多个工作流出曲线以实现可互操作的协作科学
5. Provenance Management for Collaborative Data Science Workflows [D] . Miao, Hui. 2018

机译：协作数据科学工作流程的源管理
6. Sharing interoperable workflow provenance: A review of best practices and their practical application in CWLProv [O] . Farah Zaib Khan, Stian Soiland-Reyes, Richard O Sinnott, 2019

机译：共享可互操作的工作流来源：最佳实践及其在CWLProv中的实际应用的回顾
7. Linking Multiple Workflow Provenance Traces for Interoperable Collaborative Science [O] . Paolo Missier, Carole Goble, Saumen Dey, 2013

机译：链接可互操作协作科学的多个工作流程源项跟踪
8. Computing Science: D-PROV: Extending the PROV Provenance Model with Workflow Structure. [R] . Missier, P., Key, S., Belhajjame, K., 2013

机译：计算科学：D-pROV：利用工作流结构扩展pROV原产地模型。

Linking multiple workflow provenance traces for interoperable collaborative science

摘要

著录项

相似文献

相关主题

期刊订阅