首页> 外文学位 >Composing and conveying lineage metadata for environmental science research computing.
【24h】

Composing and conveying lineage metadata for environmental science research computing.

机译:组成和传达沿袭元数据以进行环境科学研究计算。

获取原文
获取原文并翻译 | 示例

摘要

Although the online propagation of environmental and Earth science data is increasing, the record of its origins and processing history---its lineage---is often absent, inadequate or irretrievable by potential users. This work is meant to address the following research question: How can organizations that perform data processing to support environmental and Earth science research attain the ability to compose (arrange in proper form) and convey (communicate to others) lineage metadata for the data products they create?; Lineage retrieval requires the capability to assemble a retrospective view of workflow using extant metadata. A review of lineage-related research from the past two decades provides a framework to clarify the architecture of previous prototypes, and direct the architectural design of new systems. Through experience with example workflows for calculating oceanic primary production, we explore workflow and metadata for script-based data processing. We investigate a prototype lineage server that introduces a level of indirection for the metadata objects presented in lineage graphs.; A significant problem facing potential data consumers is lineage retrieval for the results of data processing that span multiple research groups or organizations. The linchpin data products that connect the workflow invocations from different organizations as the links of a data processing chain are the key to maintaining the continuity of the complementary, retrospective lineage. We propose using the Resource Description Framework (RDF) as a standard, portable format for summarizing the lineage metadata of workflow invocations.; This work investigates alternatives to the Earth System Science Workbench proposals for composing and conveying the lineage of satellite-derived data products. Research contributions include: developing a workflow and metadata model to facilitate composing both fundamental and lineage metadata for script-based data processing; proposing the concept of a standalone lineage server to provide additional flexibility for delivering the metadata of objects in the lineage; and investigating the use of embedding or linking RDF/XML lineage metadata within the fundamental metadata for a data product to connect the links of the "lineage chain," that is, a chain of workflow invocations, across organizations.
机译:尽管环境和地球科学数据的在线传播正在增加,但潜在用户通常缺少,不足或无法获取其起源和加工历史记录(即沿袭)。这项工作旨在解决以下研究问题:进行数据处理以支持环境和地球科学研究的组织如何才能获得为其数据产品撰写(安排适当的形式)并传达(与他人交流)沿袭元数据的能力。创建?;沿袭检索要求能够使用现有元数据来组合工作流的回顾视图。对过去二十年中与谱系相关的研究的回顾提供了一个框架,以阐明以前的原型的体系结构,并指导新系统的体系结构设计。通过对用于计算海洋一次生产的示例工作流的经验,我们探索了用于基于脚本的数据处理的工作流和元数据。我们研究了一个原型沿袭服务器,该服务器为沿袭图中呈现的元数据对象引入了一个间接级别。潜在数据使用者面临的一个重要问题是沿袭检索跨多个研究小组或组织的数据处理结果。关键数据产品将来自不同组织的工作流调用连接起来,作为数据处理链的链接,是保持互补性,追溯性血统的连续性的关键。我们建议使用资源描述框架(RDF)作为标准的可移植格式,以汇总工作流调用的沿袭元数据。这项工作研究了地球系统科学工作台提案的替代方案,这些方案用于组成和传达卫星数据产品的血统。研究成果包括:开发工作流和元数据模型,以促进基本和沿袭元数据的组合,以用于基于脚本的数据处理;提出独立谱系服务器的概念,以提供额外的灵活性来传递谱系中对象的元数据;以及研究在数据产品的基本元数据中嵌入或链接RDF / XML谱系元数据,以跨组织连接“谱系链”(即工作流程调用链)的链接。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号