首页> 外文会议>IEEE International Conference on e-Science >Interactive provenance summaries for reproducible science
【24h】

Interactive provenance summaries for reproducible science

机译:可重复科学的互动源头摘要

获取原文

摘要

Recorded provenance facilitates reproducible science. Provenance metadata can help determine how data were possibly transformed, processed, and derived from original sources. While provenance is crucial for verification and validation, there remains the issue of the granularity — detail at which provenance data must be provided to a user, especially for conducting reproducible science. When data are reproduced successfully the need for detailed provenance is minimal and an essence of the recorded provenance suffices. However, when data are not reproduced correctly users want to quickly drill down into fine-grained provenance to understand causes for failure. In this paper, we describe a drill-up/drill-down method for exploring provenance traces. The drill-up method summarizes the trace by grouping nodes and edges of the trace that have same derivation histories. The method preserves provenance data flow semantics. The drill-down method compares summary groups and ranks groups that may have information about the errors. Both the methods are implemented in an efficient manner using light-weight data structures so as to be suitable for reproducible science. We conduct a thorough experimental analysis to show how the operators perform in compressing and expanding real provenance graphs.
机译:记录的出处有助于可重复的科学。来源元数据可以帮助确定数据如何变换,处理和派生自原始源。虽然来源对于验证和验证至关重要,但仍然存在粒度的问题 - 必须向用户提供物质数据的细节,特别是用于进行可重复的科学。当成功再现数据时,对详细出处的需求是最小的,并且记录的出处的本质就足够了。但是,当数据未正确复制数据时,用户希望快速钻取到细粒度的物质以了解故障的原因。在本文中,我们描述了一种用于探索出处迹线的钻孔/钻孔方法。钻取方法通过分组具有相同导出历史的跟踪的节点和边缘来总结轨迹。该方法保留来自物质数据流语义。钻取方法比较摘要组和排名可以具有有关错误信息的组。这两种方法都以高权重数据结构以有效的方式实现,以便适合于可再现的科学。我们进行了彻底的实验分析,以展示操作员如何在压缩和扩大真实的原子生物图中进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号