首页> 外文期刊>Data & Knowledge Engineering >OPQL: Querying scientific workflow provenance at the graph level
【24h】

OPQL: Querying scientific workflow provenance at the graph level

机译:OPQL:在图形级别查询科学工作流出处

获取原文
获取原文并翻译 | 示例

摘要

Provenance has become increasingly important in scientific workflows to understand, verify, and reproduce the result of scientific data analysis. Most existing systems store provenance data in provenance stores with proprietary provenance data models and conduct query processing over the physical provenance storages using query languages, such as SQL, SPARQL, and XQuery, which are closely coupled to the underlying storage strategies. Querying provenance at such low level leads to poor usability of the system: a user needs to know the underlying schema to formulate queries; if the schema changes, queries need to be reformulated; and queries formulated for one system will not run in another system. In this paper, we present OPQL, a provenance query language that enables the querying of provenance directly at the graph level. An OPQL query takes a provenance graph as input and produces another provenance graph as output. Therefore, OPQL queries are not tightly coupled to the underlying provenance storage strategies. Our main contributions are: (ⅰ) we design OPQL, including six types of graph patterns, a provenance graph algebra, and OPQL syntax and semantics, that supports querying provenance at the graph level; (ⅱ) we implement OPQL using a Web service via our OPMProv system; therefore, users can invoke the Web service to execute OPQL queries in a provenance browser, called OPMProVis. The result of OPQL queries is displayed as a provenance graph in OPMProVis. An experimental study is conducted to evaluate the feasibility and performance of OPMProv on OPQL provenance querying.
机译:在理解,验证和重现科学数据分析结果的过程中,出处在科学工作流程中变得越来越重要。大多数现有系统将具有专有来源数据模型的来源数据存储在来源存储中,并使用与底层存储策略紧密耦合的查询语言(例如SQL,SPARQL和XQuery)对物理来源存储进行查询处理。在如此低的级别查询出处会导致系统的可用性很差:用户需要了解底层架构才能制定查询;如果架构更改,则需要重新构造查询;为一个系统制定的查询将不会在另一系统中运行。在本文中,我们介绍了OPQL,一种出处查询语言,可以直接在图级别上查询出处。 OPQL查询将一个出处图作为输入,并生成另一个出处图作为输出。因此,OPQL查询不会与基础的来源存储策略紧密结合。我们的主要贡献是:(ⅰ)我们设计了OPQL,包括六种类型的图形模式,一个源图形代数以及OPQL语法和语义,它们支持在图形级查询源。 (ⅱ)我们通过OPMProv系统使用Web服务来实现OPQL;因此,用户可以调用Web服务在称为OPMProVis的出处浏览器中执行OPQL查询。 OPQL查询的结果在OPMProVis中显示为出处图。进行了一项实验研究,以评估OPMProv在OPQL来源查询中的可行性和性能。

著录项

  • 来源
    《Data & Knowledge Engineering》 |2013年第11期|37-59|共23页
  • 作者单位

    Department of Computer Science, Wayne State University, Detroit. Ml 48202, USA;

    Department of Computer Science, Wayne State University, Detroit. Ml 48202, USA;

    Department of Computer Science, University of Texas-Pan American, Edinburg, TX 78539, USA;

    Department of Computer Science, Wayne State University, Detroit. Ml 48202, USA;

    Department of Computer Science, Wayne State University, Detroit. Ml 48202, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    OPQL; Provenance query language; Scientific workflow provenance;

    机译:OPQL;来源查询语言;科学的工作流程出处;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号