首页> 外文学位 >Computational inference of protein structure and function from microbial genomes and metagenomes.
【24h】

Computational inference of protein structure and function from microbial genomes and metagenomes.

机译:从微生物基因组和元基因组的蛋白质结构和功能的计算推断。

获取原文
获取原文并翻译 | 示例

摘要

DNA sequences derived from genomes and metagenomes encode a wealth of information about protein structure and function. However, because of the large number of available sequences, computational and statistical methods are necessary to infer biological meaning. Here, three approaches are explored which infer protein structure or function from microbial genomes and metagenomes. First, the host-pathogen interaction between human macrophages and Mycobacterium leprae is investigated. By comparing human functional lipase domains upregulated in lepromatous lesions with the genomic repertoires of several Mycobacteria, we find that host proteins may complement lipid-associated metabolic deficiencies of M. leprae. Second, function is inferred for protein families in an ocean metagenome by identifying conserved genomic neighbors with known functions. This approach correctly infers function for many well annotated proteins, and suggests high-confidence functions for several large novel protein families. Further scrutiny of the genomic neighbors reveals that many of the novel families are phage proteins, and many other phage protein families are of bacterial origin. Finally, the information contained in large protein families derived from genome and metagenome sequences is exploited to infer residue pairs that are in contact in the 3-dimensional structures of proteins. We integrate multiple lines of evidence via a Bayesian inference procedure to produce a posterior probability of contact for all residue pairs in a protein. We use these probabilistic predicted contacts to evaluate predicted 3D protein models, and find that models that best satisfy predicted contacts are those that are most similar to correct protein structures.
机译:来自基因组和元基因组的DNA序列编码了大量有关蛋白质结构和功能的信息。但是,由于可用序列数量众多,因此需要计算和统计方法来推断生物学意义。在这里,探索了三种从微生物基因组和元基因组推断蛋白质结构或功能的方法。首先,研究了人类巨噬细胞和麻风分枝杆菌之间的宿主-病原体相互作用。通过比较在麻风病灶中上调的人类功能性脂肪酶结构域与几种分枝杆菌的基因组库,我们发现宿主蛋白可以补充麻风分枝杆菌的脂质相关代谢缺陷。第二,通过鉴定具有已知功能的保守基因组邻居来推断海洋基因组中蛋白质家族的功能。这种方法可以正确地推断出许多具有良好注释的蛋白质的功能,并暗示了一些大型新型蛋白质家族的高可信度功能。对基因组邻居的进一步检查显示,许多新的家族是噬菌体蛋白,许多其他噬菌体蛋白家族是细菌起源的。最后,利用来自基因组和元基因组序列的大型蛋白质家族中包含的信息来推断在蛋白质3维结构中接触的残基对。我们通过贝叶斯推理程序整合多条证据,以产生蛋白质中所有残基对接触的后验概率。我们使用这些概率预测接触评估了3D蛋白质预测模型,发现最能满足预测接触的模型是与正确蛋白质结构最相似的模型。

著录项

  • 作者

    Miller, Christopher Scott.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 164 p.
  • 总页数 164
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:38:47

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号