首页> 外文学位 >Representing meaningful provenance in scientific workflow systems.
【24h】

Representing meaningful provenance in scientific workflow systems.

机译:代表科学工作流程系统中有意义的来源。

获取原文
获取原文并翻译 | 示例

摘要

Data Provenance has been a major issue for Scientific Workflow Systems in the past several years. Provenance of data can be considered to be of two forms, originally proposed by Buneman et al. Where Provenance for data can be considered to be related to the origination of data, such the original source. In contrast, Why Provenance is related to the lifetime of the data and all of the operations and changes made to the data over the span of use.; The business community has been the main source of inspiration for how to handle provenance, but it has not considered the primarily data-centric side of scientific workflows. In contrast to the focus on data in science workflows, business workflows are generally based on the control flow.; The database community has also tried to address the issues relating to data provenance. Approaches that try to solve these issues, such as secondary databases and inversions, need modifications as the assumptions differ when applied to systems that are not database-oriented.; Our focus is based on trying to model the data provenance, not just at the user-level, but at the intermediate and data level. By focusing on the workflow where it relates to the underlying structure, we can create a more compact representation of provenance at the data level, a more formalized provenance representation at the intermediate level, as well as display more intuitive data provenance at the user level. We are also focusing on trying to define more clearly what data provenance is when applied to pipelined workflow systems. This approach will allow the data provenance generated in research to be more compact, formal, and usable.
机译:在过去的几年中,数据来源一直是科学工作流程系统的一个主要问题。数据来源可以认为是两种形式,最初由Buneman等人提出。可将数据来源视为与数据来源有关的地方,例如原始来源。相反,为什么出处与数据的生命周期以及整个使用范围内对数据的所有操作和更改有关。商业社区一直是如何处理出处的主要灵感来源,但并未考虑科学工作流的主要以数据为中心的方面。与专注于科学工作流中的数据相反,业务工作流通常基于控制流。数据库社区还尝试解决与数据来源有关的问题。尝试解决这些问题的方法(例如辅助数据库和倒置)需要修改,因为当应用于非面向数据库的系统时,假设会有所不同。我们的重点是基于尝试对数据来源​​进行建模,不仅是在用户级别,而且还在中间和数据级别。通过关注与基础结构相关的工作流,我们可以在数据级别创建更紧凑的来源表示,在中间级别创建更正式的来源表示,并在用户级别显示更直观的数据来源。我们还将重点放在尝试更清楚地定义将什么数据源应用于流水线工作流系统时。这种方法将使研究中生成的数据源更加紧凑,正式和可用。

著录项

  • 作者

    Bryant, Miranda A.;

  • 作者单位

    University of Wyoming.$bComputer Science.;

  • 授予单位 University of Wyoming.$bComputer Science.;
  • 学科 Computer Science.
  • 学位 M.S.
  • 年度 2007
  • 页码 68 p.
  • 总页数 68
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号