首页> 外文会议>International work-conference on bioinformatics and biomedical engineering >Associating Gene Ontology Terms with Pfam Protein Domains
【24h】

Associating Gene Ontology Terms with Pfam Protein Domains

机译:将基因本体术语与Pfam蛋白域相关联

获取原文

摘要

With the growing number of three-dimensional protein structures in the protein data bank (PDB), there is a need to annotate these structures at the domain level in order to relate protein structure to protein function. Thanks to the SIFTS database, many PDB chains are now cross-referenced with Pfam domains and Gene ontology (GO) terms. However, these annotations do not include any explicit relationship between individual Pfam domains and GO terms. Therefore, creating a direct mapping between GO terms and Pfam domains will provide a new and more detailed level of protein structure annotation. This article presents a novel content-based filtering method called GODM that can automatically infer associations between GO terms and Pfam domains directly from existing GO-chain/Pfam-chain associations from the SIFTS database and GO-sequence/Pfam-sequence associations from the UniProt databases. Overall, GODM finds a total of 20,318 non-redundant GO-Pfam associations with a F-measure of 0.98 with respect to the InterPro database, which is treated here as a "Gold Standard". These associations could be used to annotate thousands of PDB chains or protein sequences for which their domain composition is known but which currently lack any GO annotation. The GODM database is publicly available at http://godm.loria.fr/.
机译:随着蛋白质数据库(PDB)中三维蛋白质结构数量的增加,有必要在域级别注释这些结构,以使蛋白质结构与蛋白质功能相关联。多亏了SIFTS数据库,现在许多PDB链已与Pfam域和基因本体论(GO)术语交叉引用。但是,这些注释不包括单个Pfam域和GO术语之间的任何显式关系。因此,在GO词和Pfam域之间创建直接映射将提供新的和更详细的蛋白质结构注释级别。本文介绍了一种新颖的基于内容的过滤方法,称为GODM,它可以直接从SIFTS数据库中的现有GO链/ Pfam-链关联中自动推断出GO术语与Pfam域之间的关联,并从UniProt中推断出GO-sequence / Pfam-sequence关联数据库。总体而言,对于InterPro数据库,GODM发现总共有20,318个非冗余GO-Pfam关联,其F度量为0.98,在这里被称为“黄金标准”。这些关联可用于注释成千上万个PDB链或蛋白质序列,这些PDB链或蛋白质序列的域组成已知,但目前缺少任何GO注释。 GODM数据库可从http://godm.loria.fr/公开获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号