首页> 外文会议>IEEE International Congress on Big Data >Big data provenance: Challenges, state of the art and opportunities
【24h】

Big data provenance: Challenges, state of the art and opportunities

机译:大数据出处:挑战,艺术状态和机遇

获取原文

摘要

Ability to track provenance is a key feature of scientific workflows to support data lineage and reproducibility. The challenges that are introduced by the volume, variety and velocity of Big Data, also pose related challenges for provenance and quality of Big Data, defined as veracity. The increasing size and variety of distributed Big Data provenance information bring new technical challenges and opportunities throughout the provenance lifecycle including recording, querying, sharing and utilization. This paper discusses the challenges and opportunities of Big Data provenance related to the veracity of the datasets themselves and the provenance of the analytical processes that analyze these datasets. It also explains our current efforts towards tracking and utilizing Big Data provenance using workflows as a programming model to analyze Big Data.
机译:跟踪出处的能力是支持数据谱系和再现性的科学工作流的关键特征。大数据的数量,品种和速度引入的挑战,也造成了对大数据的出处和质量的相关挑战,定义为真实性。越来越大的分布式大数据出处信息提供新的技术挑战和机遇,包括录制,查询,共享和利用。本文讨论了与数据集本身的真实性以及分析这些数据集的分析过程的求和的挑战和机遇。它还解释了我们目前正在努力跟踪和利用大数据出处的努力,使用工作流作为编程模型来分析大数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号