A rapid classification protocol for the CATH Domain Database to support structural genomics.

Pearl FM iochem.ucl.ac.uk; Martin N; Bray JE; Buchan DW; Harrison AP; Lee D; Reeves GA; Shepherd AJ; Sillitoe I; Todd AE; Thornton JM; Orengo CA

首页> 外文期刊>Nucleic Acids Research >A rapid classification protocol for the CATH Domain Database to support structural genomics.

【24h】

A rapid classification protocol for the CATH Domain Database to support structural genomics.

机译：CATH域数据库的快速分类协议，可支持结构基因组学。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to support the structural genomic initiatives, both by rapidly classifying newly determined structures and by suggesting suitable targets for structure determination, we have recently developed several new protocols for classifying structures in the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath). These aim to increase the speed of classification of new structures using fast algorithms for structure comparison (GRATH) and to improve the sensitivity in recognising distant structural relatives by incorporating sequence information from relatives in the genomes (DomainFinder). In order to ensure the integrity of the database given the expected increase in data, the CATH Protein Family Database (CATH-PFDB), which currently includes 25,320 structural domains and a further 160,000 sequence relatives has now been installed in a relational ORACLE database. This was essential for developing more rigorous validation procedures and for allowing efficient querying of the database, particularly for genome analysis. The associated Dictionary of Homologous Superfamilies [Bray,J.E., Todd,A.E., Pearl,F.M.G., Thornton,J.M. and Orengo,C.A. (2000) Protein Eng., 13, 153-165], which provides multiple structural alignments and functional information to assist in assigning new relatives, has also been expanded recently and now includes information for 903 homologous superfamilies. In order to improve coverage of known structures, preliminary classification levels are now provided for new structures at interim stages in the classification protocol. Since a large proportion of new structures can be rapidly classified using profile-based sequence analysis [e.g. PSI-BLAST: Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 3389-3402], this provides preliminary classification for easily recognisable homologues, which in the latest release of CATH (version 1.7) represented nearly three-quarters of the non-identical structures.

机译：为了支持结构基因组计划，通过快速分类新确定的结构并建议合适的结构确定目标，我们最近开发了几种新的协议，用于在CATH域数据库中对结构进行分类（http：//www.biochem.ucl .ac.uk / bsm / cath）。这些旨在通过使用结构比较快速算法（GRATH）提高新结构分类的速度，并通过将来自基因组亲戚的序列信息纳入基因组（DomainFinder）来提高识别远处结构亲戚的敏感性。为了在预期的数据增加的情况下确保数据库的完整性，CATH蛋白家族数据库（CATH-PFDB）目前已包含25,320个结构域，并且在关系型ORACLE数据库中已安装了另外160,000个序列亲戚。这对于开发更严格的验证程序以及允许高效查询数据库（尤其是基因组分析）至关重要。相关的同源超家族词典[Bray，J.E。，Todd，A.E。，Pearl，F.M.G。，Thornton，J.M。和Orengo，C.A。（2000）Protein Eng。，13，153-165]，它提供了多个结构比对和功能信息以帮助分配新的亲戚，最近也得到了扩展，现在包括903个同源超家族的信息。为了提高已知结构的覆盖范围，现在在分类协议的过渡阶段为新结构提供了初步的分类级别。由于可以使用基于配置文件的序列分析快速对大部分新结构进行分类[例如， PSI-BLAST：Altschul，S.F.，Madden，TL。，Schaffer，A.A.，Zhang，J.，Zhang，Z.，Miller，W。和Lipman，D.J。（1997）Nucleic Acids Res。，25，3389-3402]，这为易于识别的同系物提供了初步分类，在最新版本的CATH（1.7版）中，它代表了几乎四分之三的不相同结构。

著录项

来源
《Nucleic Acids Research》 |2001年第1期|共5页
作者
Pearl FM iochem.ucl.ac.uk; Martin N; Bray JE; Buchan DW; Harrison AP; Lee D; Reeves GA; Shepherd AJ; Sillitoe I; Todd AE; Thornton JM; Orengo CA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类细胞形态学;生物化学;
关键词
Databases; Factual; Proteins; 数据库; 事实型; 蛋白质类;

机译：Databases;Factual;Proteins;数据库;事实型;蛋白质类;

相似文献

外文文献
中文文献
专利

1. A rapid classification protocol for the CATH Domain Database to support structural genomics. [J] . Pearl FM iochem.ucl.ac.uk, Martin N, Bray JE, Nucleic Acids Research . 2001,第1期

机译：CATH域数据库的快速分类协议，可支持结构基因组学。
2. The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution [J] . Adam Reid, Alison Cuff, Christine A. Orengo, Nucleic acids research . 2007,第suppla1期

机译：Cath域结构数据库：新的协议和分类级别为探索演变提供了更全面的资源
3. Structural diversity of domain superfamilies in the CATH database. [J] . Reeves GA, Dallman TJ, Redfern OC, Journal of Molecular Biology . 2006,第3期

机译：CATH数据库中域超家族的结构多样性。
4. Prospects and limitations in the context of knowledge discovery in database for manipulation of domains through ontologies to support the modeling of data warehouse -Case study in social databases [C] . Monteiro Adriana Costa, Galvez Luis Enrique Zarate 38th Latin America Conference on Informatics. . 2012

机译：在数据库中知识发现方面的前景和局限性，这些知识用于通过本体操纵域以支持数据仓库建模-社会数据库中的案例研究
5. Prediction-based genome annotation, domain assignment methods, and their applications in structural genomics. [D] . Liu, Jinfeng. 2004

机译：基于预测的基因组注释，域分配方法及其在结构基因组学中的应用。
6. A rapid classification protocol for the CATH Domain Database to support structural genomics [O] . Frances M. G. Pearl, Nigel Martin, James E. Bray, 2001

机译：快速分类协议 CATH域数据库以支持结构基因组学
7. A rapid classification protocol for the CATH Domain Database to support structural genomics [O] . Pearl, Frances M. G., Martin, Nigel, Bray, James E., 2001

机译：快速分类协议 CATH域数据库以支持结构基因组学

A rapid classification protocol for the CATH Domain Database to support structural genomics.

摘要

著录项

相似文献

相关主题

期刊订阅