首页> 外文OA文献 >UniProt in RDF: Tackling Data Integration and Distributed Annotation with the Semantic Web
【2h】

UniProt in RDF: Tackling Data Integration and Distributed Annotation with the Semantic Web

机译:RDF中的UniProt:利用语义网处理数据集成和分布式注释

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The UniProt knowledgebase (UniProtKB) is a comprehensive repository of protein sequence and annotation data. We collect information from the scientific literature and other databases and provide links to over one hundred biological resources. Such links between different databases are an important basis for data integration, but the lack of a common standard to represent and link information makes data integration an expensive business. At UniProt we have started to tackle this problem by using the Resource Description Framework ("http://www.w3.org/RDF/":http://www.w3.org/RDF/) to represent our data. RDF is a core technology for the World Wide Web Consortium's Semantic Web activities ("http://www.w3.org/2001/sw/":http://www.w3.org/2001/sw/) and is therefore well suited to work in a distributed and decentralized environment. The RDF data model represents arbitrary information as a set of simple statements of the form subject-predicate-object. To enable the linking of data on the Web, RDF requires that each resource must have a (globally) unique identifier. These identifiers allow everybody to make statements about a given resource and, together with the simple structure of the RDF data model, make it easy to combine the statements made by different people (or databases) to allow queries across different datasets. RDF is thus an industry standard that can make a major contribution to solve two important problems of bioinformatics: distributed annotation and data integration.
机译:UniProt知识库(UniProtKB)是蛋白质序列和注释数据的综合存储库。我们从科学文献和其他数据库中收集信息,并提供指向一百多种生物资源的链接。不同数据库之间的此类链接是数据集成的重要基础,但是缺乏表示和链接信息的通用标准,使得数据集成成为一项昂贵的业务。在UniProt,我们已经开始通过使用资源描述框架(“ http://www.w3.org/RDF/":http://www.w3.org/RDF/)来表示我们的数据来解决这个问题。 RDF是万维网联盟的语义Web活动(“ http://www.w3.org/2001/sw/":http://www.w3.org/2001/sw/)的核心技术,因此非常适合在分散和分散的环境中工作。 RDF数据模型将任意信息表示为一组格式为subject-predicate-object的简单语句。为了启用Web上的数据链接,RDF要求每个资源必须具有(全局)唯一标识符。这些标识符使每个人都可以针对给定资源发表声明,并且与RDF数据模型的简单结构一起,可以轻松地组合由不同人员(或数据库)做出的声明,以允许跨不同数据集进行查询。因此,RDF是一项行业标准,可以为解决生物信息学的两个重要问题做出重大贡献:分布式注释和数据集成。

著录项

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号