首页> 外文期刊>PeerJ Computer Science >Interoperability and FAIRness through a novel combination of Web technologies
【24h】

Interoperability and FAIRness through a novel combination of Web technologies

机译:通过Web技术的新颖组合实现互操作性和公平性

获取原文
       

摘要

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs.
机译:生命科学中的数据种类繁多,并且存储在广泛的存储库中,从针对特定数据类型设计的存储库(例如,用于途径数据的KEGG或用于蛋白质数据的UniProt)到通用的存储库(例如FigShare, Zenodo,Dataverse或EUDAT)。这些数据在敏感性和安全性方面有很大不同的级别。例如,有关患者遗传突变的临床观察高度敏感,而物种多样性的观察通常不敏感。从一个存储库到另一个存储库的数据模型缺乏统一性,以及元数据描述的丰富性和可用性不足,使得这些数据的集成和分析成为一项手动,耗时的任务,并且没有可伸缩性。在这里,我们探索了一组面向资源的Web设计模式,用于数据发现,可访问性,转换和集成,这些模式可以由任何通用或专用存储库实现,以作为一种手段来帮助用户查找和重用其数据存储。我们表明,通过使用现有技术,可以在单个电子表格单元格级别实现互操作性。我们注意到,该体系结构的行为与FAIR数据原则所定义的愿望相比具有优势,因此可以代表这些原则的示例实现。提议的互操作性设计模式可以用于改进新数据和旧数据的发现和集成,从而最大限度地利用所有学术成果。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号