首页> 外文期刊>Journal of internet services and applications >A data infrastructure reference model with applications: towards realization of a ScienceTube vision with a data replication service
【24h】

A data infrastructure reference model with applications: towards realization of a ScienceTube vision with a data replication service

机译:带有应用程序的数据基础结构参考模型:通过数据复制服务实现ScienceTube愿景

获取原文
           

摘要

The wide variety of scientific user communities work with data since many years and thus have already a wide variety of data infrastructures in production today. The aim of this paper is thus not to create one new general data architecture that would fail to be adopted by each and any individual user community. Instead this contribution aims to design a reference model with abstract entities that is able to federate existing concrete infrastructures under one umbrella. A reference model is an abstract framework for understanding significant entities and relationships between them and thus helps to understand existing data infrastructures when comparing them in terms of functionality, services, and boundary conditions. A derived architecture from such a reference model then can be used to create a federated architecture that builds on the existing infrastructures that could align to a major common vision. This common vision is named as ’ScienceTube’ as part of this contribution that determines the high-level goal that the reference model aims to support. This paper will describe how a well-focused use case around data replication and its related activities in the EUDAT project aim to provide a first step towards this vision. Concrete stakeholder requirements arising from scientific end users such as those of the European Strategy Forum on Research Infrastructure (ESFRI) projects underpin this contribution with clear evidence that the EUDAT activities are bottom-up thus providing real solutions towards the so often only described ’high-level big data challenges’. The followed federated approach taking advantage of community and data centers (with large computational resources) further describes how data replication services enable data-intensive computing of terabytes or even petabytes of data emerging from ESFRI projects.
机译:多年来,各种各样的科学用户社区都在使用数据,因此今天的生产中已经拥有各种各样的数据基础架构。因此,本文的目的不是创建一种新的通用数据架构,而该架构将不会被每个用户社区所采用。相反,此贡献旨在设计具有抽象实体的参考模型,该模型能够在一个保护伞下联合现有的具体基础架构。参考模型是用于理解重要实体及其之间关系的抽象框架,因此在按功能,服务和边界条件进行比较时,有助于理解现有数据基础结构。然后,可以使用从此类参考模型派生的体系结构来创建一个联邦体系结构,该体系结构建立在可以与主要共同愿景一致的现有基础架构之上。此共同愿景被称为“ ScienceTube”,这是确定参考模型旨在支持的高级目标的贡献之一。本文将描述围绕数据复制及其在EUDAT项目中的相关活动的重点突出的用例,旨在如何为实现这一愿景提供第一步。来自科学最终用户的利益相关者的具体要求,例如欧洲研究基础设施战略论坛(ESFRI)项目的利益相关者的要求,为这一贡献奠定了基础,清楚地表明EUDAT活动是自下而上的,从而为通常仅描述为“高要求”的问题提供了真正的解决方案。应对大数据挑战”。接下来的利用社区和数据中心(具有大量计算资源)的联合方法进一步描述了数据复制服务如何实现对从ESFRI项目中产生的TB甚至PB级数据的数据密集型计算。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号