首页> 美国卫生研究院文献>Nucleic Acids Research >The Protein Information Resource: an integrated public resource of functional annotation of proteins
【2h】

The Protein Information Resource: an integrated public resource of functional annotation of proteins

机译:蛋白质信息资源:蛋白质功能注释的综合公共资源

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases).
机译:蛋白质信息资源(PIR)可作为蛋白质数据功能注释的集成公共资源,以支持基因组/蛋白质组学研究和科学发现。 PIR与慕尼黑蛋白质序列信息中心(MIPS)和日本国际蛋白质信息数据库(JIPID)合作,产生了PIR-国际蛋白质序列数据库(PSD),这是公共领域中主要的带注释的蛋白质序列数据库,包含约25万种蛋白质。为了改善蛋白质注释和实验验证数据的覆盖范围,开发了一种书目提交系统,供科学家提交,分类和检索文献信息。 iProClass提供了全面的蛋白质信息,包括超家族的家族分类,域和基序水平,蛋白质的结构和功能特征,以及对40多个生物学数据库的交叉引用。为了提供具有来源归属的及时而全面的蛋白质数据,我们引入了一个非冗余参考蛋白质数据库PIR-NREF。该数据库包括从PIR-PSD,SWISS-PROT,TrEMBL,GenPept,RefSeq和PDB收集的约80万种蛋白质,并带有复合蛋白质名称和文献数据。为了提高数据库的互操作性,我们提供XML数据分发和开放的数据库架构,并采用常见的本体。 PIR网站(http://pir.georgetown.edu/)提供了数据挖掘和序列分析工具,用于基于序列和注释信息进行信息检索和蛋白质功能识别。 PIR数据库和其他文件也可以通过FTP(ftp:/brfa.georgetown.edu/pir_databases)获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号