A Noisy 10GB Provenance Database

机译：嘈杂的10GB来源数据库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Provenance of scientific data is a key piece of the metadata record for the data's ongoing discovery and reuse. Provenance collection systems capture provenance on the fly, however, the protocol between application and provenance tool may not be reliable. Consequently, the provenance record can be partial, partitioned, and simply inaccurate. We use a workflow emulator that models faults to construct a large 10GB database of provenance that we know is noisy (that is, has errors). We discuss the process of generating the provenance database, and show early results on the kinds of provenance analysis enabled by the large provenance.

机译：科学数据的来源是元数据记录中用于数据的持续发现和重用的关键部分。种源收集系统可以实时捕获种源，但是，应用程序和种源工具之间的协议可能并不可靠。因此，出处记录可能是不完整的，不完整的，不完整的。我们使用对故障进行建模的工作流仿真器来构造一个大型的10GB起源数据库，我们知道该数据库是嘈杂的（即有错误）。我们讨论了出处数据库的生成过程，并展示了由大量出处所支持的各种出处分析的早期结果。

著录项

来源
《International conference on business process management;BPM 2011》|2011年|p.370-381|共12页
会议地点
作者
You-Wei Cheah; Beth Plale; Joey Kendall-Morwick; David Leake; Lavanya Ramakrishnan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电子贸易、网上贸易;
关键词
data provenance; scientific workflows; provenance quality; case-based reasoning;

机译：数据来源;科学的工作流程;种源质量;基于案例的推理;

相似文献

外文文献
中文文献
专利

1. Reassessment of WD-XRF method for obsidian provenance shareable databases [J] . Acquafredda Pasquale, Muntoni Italo M., Pallara Mauro Quaternary International . 2018,第PTaA期

机译：WD-XRF方法对黑曜石来源共享数据库的重新评估
2. Italian tomato-based products authentication by multi-element approach: A mineral elements database to distinguish the domestic provenance [J] . Fragni Rosaria, Trifiro Antonio, Nucci Ada, Food Control . 2018,第期

机译：基于意大利番茄的产品通过多元素方法进行身份验证：矿物元素数据库区分国内源
3. Improving provenance data in natural history collection databases [J] . Rosenberg Gary, Khoo Michael American malacological bulletin . 2018,第2期

机译：在自然历史集合数据库中提高出处数据
4. Twitter Data Modelling and Provenance Support for Key-Value Pair Databases [C] . Asma Rani, Navneet Goyal, Shashi K. Gadia Australasian Database Conference . 2021

机译：键值对数据库的Twitter数据建模和出处支持
5. Online efficient and effective search in large and noisy sequence databases [D] . Kotsifakos, Alexios 2014

机译：大型且嘈杂的序列数据库中的在线高效搜索
6. A geochemical characterization of lead ores in China: An isotope database for provenancing archaeological materials [O] . Yiu-Kang Hsu, Benjamin J. Sabatini 2012

机译：中国铅矿石的地球化学特征：考古资料来源的同位素数据库
7. Database "Climate parameters of seed provenances of pine in northern eurasia" [O] . Elena Parfenova 2020

机译：数据库“欧亚北部杉木种子种子的气候参数”
8. Noisy 10GB Provenance Database. [R] . Cheah, Y. W., Plale, B., Kendall-Morwick, J., 2013

机译：嘈杂的10GB provenance数据库。

A Noisy 10GB Provenance Database

摘要

著录项

相似文献

相关主题

期刊订阅