首页> 外文期刊>Information systems frontiers >Quarry: A User-centered Big Data Integration Platform
【24h】

Quarry: A User-centered Big Data Integration Platform

机译:采石场:以用户为中心的大数据集成平台

获取原文
获取原文并翻译 | 示例
       

摘要

Obtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data formats, discovering integration paths, and all this given specific analytical needs of a data analyst. Along with large volumes of data, the variety of formats, data models, and semantics drastically contribute to the complexity of such processes. Although there have been many attempts to automate various processes along the Big Data pipeline, no unified platforms accessible by users without technical skills (like statisticians or business analysts) have been proposed. In this paper, we present a Big Data integration platform (Quarry) that uses hypergraph-based metadata to facilitate (and largely automate) the integration of domain data coming from a variety of sources, and provides an intuitive interface to assist end users both in: (1) data exploration with the goal of discovering potentially relevant analysis facets, and (2) consolidation and deployment of data flows which integrate the data, and prepare them for further analysis (descriptive or predictive), visualization, and/or publishing. We validate Quarry's functionalities with the use case of World Health Organization (WHO) epidemiologists and data analysts in their fight against Neglected Tropical Diseases (NTDs).
机译:从数据获得有价值的见解和可操作知识需要跨分析通常来自各种来源的域数据。这样做,不可避免地强加了统一不同数据格式的繁重过程,发现集成路径,以及所有这些数据分析师的特定分析需求。随着大量的数据,各种格式,数据模型和语义均可促进这些过程的复杂性。虽然已经有许多尝试沿着大数据管道自动化各种过程,但没有提出没有技术技能(如统计学家或商业分析师)的用户访问的统一平台。在本文中,我们提供了一个大数据集成平台(采石场),它使用基于超图的元数据来促进(并且在很大程度上自动化)来自各种来源的域数据的集成,并提供直观的界面来帮助最终用户:(1)数据探索,目的是发现潜在的相关分析面,(2)整合和部署数据流量,这些数据流集成了数据,并准备了进一步分析(描述性或预测),可视化和/或发布。我们通过世界卫生组织(WHO)流行病学家和数据分析师对抗被忽视的热带疾病(NTDS)的使用情况来验证采石场的功能。

著录项

  • 来源
    《Information systems frontiers》 |2021年第1期|9-33|共25页
  • 作者单位

    Universitat Politecnica de Catalunya (BarcelonaTech) Campus Nord Omega-125 UPC - dept ESSI C/Jordi Girona 1-3 E-08034 Barcelona Spain;

    Universitat Politecnica de Catalunya (BarcelonaTech) Campus Nord Omega-125 UPC - dept ESSI C/Jordi Girona 1-3 E-08034 Barcelona Spain;

    Universitat Politecnica de Catalunya (BarcelonaTech) Campus Nord Omega-125 UPC - dept ESSI C/Jordi Girona 1-3 E-08034 Barcelona Spain;

    Universitat Politecnica de Catalunya (BarcelonaTech) Campus Nord Omega-125 UPC - dept ESSI C/Jordi Girona 1-3 E-08034 Barcelona Spain;

    Universitat Politecnica de Catalunya (BarcelonaTech) Campus Nord Omega-125 UPC - dept ESSI C/Jordi Girona 1-3 E-08034 Barcelona Spain;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data Integration; Big Data; Data-Intensive Flows; Metadata;

    机译:数据集成;大数据;数据密集型流动;元数据;
  • 入库时间 2022-08-18 21:05:49

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号