首页> 外文期刊>Bioinformatics >Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation
【24h】

Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation

机译:研究整个基因本体中的语义相似性度量:序列和注释之间的关系

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Many bioinformatics data resources not only hold data in the form of sequences, but also as annotation. In the majority of cases, annotation is written as scientific natural language: this is suitable for humans, but not particularly useful for machine processing. Ontologies offer a mechanism by which knowledge can be represented in a form capable of such processing. In this paper we investigate the use of ontological annotation to measure the similarities in knowledge content or 'semantic similarity' between entries in a data resource. These allow a bioinformatician to perform a similarity measure over annotation in an analogous manner to those performed over sequences. A measure of semantic similarity for the knowledge component of bioinformatics resources should afford a biologist a new tool in their repetoire of analyses. Results: we present the results from experiments that investigate the validity of using semantic similarity by comparison with sequence similarity. We show a simple extension that enables a semantic search of the knowledge held within sequence databases.
机译:动机:许多生物信息学数据资源不仅以序列的形式保存数据,而且还以注释的形式保存数据。在大多数情况下,注释是用科学的自然语言编写的:这适用于人类,但对机器处理却不是特别有用。本体提供了一种机制,通过该机制可以以能够进行这种处理的形式表示知识。在本文中,我们研究了使用本体注释来衡量知识内容的相似性或数据资源中条目之间的“语义相似性”。这些允许生物信息学家以与对序列执行的相似的方式对注释执行相似的度量。生物信息学资源知识组成部分的语义相似性度量应为生物学家提供新的分析工具。结果:我们提供了通过与序列相似性进行比较来研究使用语义相似性的有效性的实验结果。我们展示了一个简单的扩展,它可以对序列数据库中持有的知识进行语义搜索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号