...
首页> 外文期刊>Journal of Molecular Biology >Estimating the number of protein folds and families from complete genome data.
【24h】

Estimating the number of protein folds and families from complete genome data.

机译:从完整的基因组数据估算蛋白质折叠和家族的数量。

获取原文
获取原文并翻译 | 示例

摘要

Using the data on proteins encoded in complete genomes, combined with a rigorous theory of the sampling process, we estimate the total number of protein folds and families, as well as the number of folds and families in each genome. The total number of folds in globular, water- soluble proteins is estimated at about 1000, with structural information currently available for about one-third of the number. The sequenced genomes of unicellular organisms encode from approximately 25%, for the minimal genomes of the Mycoplasmas, to 70-80% for larger genomes, such as Escherichia coli and yeast, of the total number of folds. The number of protein families with significant sequence conservation was estimated to be between 4000 and 7000, with structures available for about 20% of these. Copyright 2000 Academic Press.
机译:利用完整基因组中编码的蛋白质数据,结合严格的采样过程理论,我们可以估算蛋白质折叠和家族的总数,以及每个基因组中折叠和家族的数量。球形水溶性蛋白质的折叠总数估计为约1000,目前可获得的结构信息约占总数的三分之一。单细胞生物的测序基因组的总折叠数从大约25%(对于支原体的最小基因组)到70-80%(对于较大的基因组,如大肠杆菌和酵母)。具有显着的序列保守性的蛋白质家族的数量估计在4000至7000之间,其中约有20%的结构可用。版权所有2000学术出版社。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号