首页> 美国卫生研究院文献>other >Modeling the Evolution of Protein Domain Architectures Using Maximum Parsimony
【2h】

Modeling the Evolution of Protein Domain Architectures Using Maximum Parsimony

机译:使用最大简约性对蛋白质结构域架构的演化进行建模

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture “neighbors” identified in this way may lead to new insights about the evolution of protein function.
机译:结构域是蛋白质的基本进化单位,大多数蛋白质具有多个结构域。域建模和收集的进步使得通过对域进行线性排序来注释大部分已知蛋白序列成为可能,从而产生了其结构。蛋白质结构域结构链接了进化相关的蛋白质,并强调了它们的共有功能。在这里,我们试图通过确定现有体系结构可能已经进化的进化途径来更好地理解这种关联。我们提出了一种演化模型,在该模型中,通过推断的前驱体系结构的重新排列和新域的获取而产生体系结构。使用简约原则对这些途径进行排序,从而假定需要最少数量的独立重组事件(即裂变和聚变操作)的方案更有可能。使用代表生命树的所有三个主要分支的159个蛋白质组中存在的领域架构数据集,可以估算序列数据库中所有架构的85%以上的历史。我们发现,重排类别的分布相对于用于推断祖先物种中前体结构的替代简约规则而言是健壮的。通过分析最简约的途径,我们发现87%的架构通过简单的更改随着时间的推移变得越来越复杂,其中融合事件占裂变架构的5.6倍。我们的结果可以用于计算领域架构的相似性,例如,基于将它们分开的历史重组事件的数量。以这种方式确定的域架构“邻居”可能会导致有关蛋白质功能进化的新见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号