首页> 外文期刊>Concurrency and computation: practice and experience >Nonintrusive collection and management of data provenance in scientific workflows
【24h】

Nonintrusive collection and management of data provenance in scientific workflows

机译:科学工作流程中的数据源的非侵入式收集和管理

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we introduce an efficient mechanism to collect, store, and retrieve data provenance information in workflows of multiphysics simulations. Using notifications, we enable the nonintrusive collection of information about workflow events during workflow execution. Combining these events with workflow structure information, constant for every execution of a workflow, we obtain the data provenance information for the specific run of the workflow. Data provenance information is structured into a graph that represents workflow events on the basis of their causal dependency. We use a graph database to store this graph and utilize the traversal framework provided, to efficiently retrieve data provenance information from the graph by traversing backwards from a data object to every workflow event that is part of its provenance. Finally, we integrate data provenance information with semantics of workflow services to provide complete and meaningful data provenance information.
机译:在本文中,我们介绍了一种在多物理场仿真工作流中收集,存储和检索数据来源信息的有效机制。使用通知,我们可以在工作流程执行期间以非侵入方式收集有关工作流程事件的信息。将这些事件与工作流程的结构信息(对于工作流程的每次执行均保持不变)相结合,我们可以获得针对该工作流程的特定运行的数据来源信息。数据来源信息被组织成一个图表,该图表根据其因果关系表示工作流事件。我们使用图数据库来存储此图,并利用提供的遍历框架,通过从数据对象向后遍历作为其来源一部分的每个工作流事件,来有效地从图中检索数据源信息。最后,我们将数据来源信息与工作流服务的语义相集成,以提供完整且有意义的数据来源信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号