...
首页> 外文期刊>Frontiers in Microbiology >Phylogenetic Tracings of Proteome Size Support the Gradual Accretion of Protein Structural Domains and the Early Origin of Viruses from Primordial Cells
【24h】

Phylogenetic Tracings of Proteome Size Support the Gradual Accretion of Protein Structural Domains and the Early Origin of Viruses from Primordial Cells

机译:蛋白质组大小的系统发生示踪支持蛋白质结构域的逐渐积累和病毒从原始细胞的早期起源

获取原文

摘要

Untangling the origin and evolution of viruses remains a challenging proposition. We recently studied the global distribution of protein domain structures in thousands of completely sequenced viral and cellular proteomes with comparative genomics, phylogenomics, and multidimensional scaling methods. A tree of life describing the evolution of proteomes revealed viruses emerging from the base of the tree as a fourth supergroup of life. A tree of domains indicated an early origin of modern viral lineages from ancient cells that co-existed with the cellular ancestors. However, it was recently argued that the rooting of our trees and the basal placement of viruses was artifactually induced by small genome (proteome) size. Here we show that these claims arise from misunderstanding and misinterpretations of cladistic methodology. Trees are reconstructed unrooted, and thus, their topologies cannot be distorted a posteriori by the rooting methodology. Tracing proteome size in trees and multidimensional views of evolutionary relationships as well as tests of leaf stability and exclusion/inclusion of taxa demonstrated that the smallest proteomes were neither attracted toward the root nor caused any topological distortions of the trees. Simulations confirmed that taxa clustering patterns were independent of proteome size and were determined by the presence of known evolutionary relatives in data matrices, highlighting the need for broader taxon sampling in phylogeny reconstruction. Instead, phylogenetic tracings of proteome size revealed a slowdown in innovation of the structural domain vocabulary and four regimes of allometric scaling that reflected a Heaps law. These regimes explained increasing economies of scale in the evolutionary growth and accretion of kernel proteome repertoires of viruses and cellular organisms that resemble growth of human languages with limited vocabulary sizes. Results reconcile dynamic and static views of domain frequency distributions that are consistent with the axiom of spatiotemporal continuity that is tenet of evolutionary thinking.
机译:弄清病毒的起源和进化仍然是一个具有挑战性的主张。我们最近通过比较基因组学,系统基因组学和多维缩放方法研究了数千种完全测序的病毒和细胞蛋白质组中蛋白质结构域结构的全球分布。一棵描述蛋白质组进化的生命树揭示了从树的根部出现的病毒是生命的第四超群。域树表明,现代病毒谱系起源于与细胞祖先共存的古代细胞。但是,最近有人争论说,树木的生根和病毒的基础位置是由小的基因组(蛋白质组)大小人为地诱导的。在这里,我们表明,这些主张源于对方法论方法的误解和误解。树木是无根地重建的,因此,通过生根方法无法使它们的拓扑结构变形。跟踪树木中的蛋白质组大小和进化关系的多维视图以及叶稳定性和分类单元的排除/包含测试表明,最小的蛋白质组既不吸引根部也不引起树木的拓扑变形。模拟证实,分类单元的聚类模式与蛋白质组大小无关,并且由数据矩阵中已知进化亲戚的存在来确定,这突显出在系统发育重建中需要更广泛的分类单元采样。取而代之的是,蛋白质组大小的系统发育示踪揭示了结构域词汇和四种异速变比缩放机制(反映Heaps定律)的创新放缓。这些机制解释了病毒和细胞有机体的进化蛋白质组库的进化增长和积累中规模经济的增长,这类似于人类语言的发展,词汇量有限。结果调和了域频率分布的动态和静态视图,这些视图与时空连续性公理一致,后者是进化思想的宗旨。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号