Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
展开▼
机译:真核基因外显子-内含子结构的进化一直是一个长期而激烈的争论。内含子的概念很早,后来更名为“内含子优先”,认为蛋白质编码基因甚至在生命进化的最早阶段就被众多内含子打断,并且内含子通过促进编码蛋白质的序列重组在蛋白质起源中起着重要作用。小蛋白质/肽模块。内含子-晚期概念认为,内含子仅在真核生物中出现,而新的内含子在整个真核生物进化过程中一直在不断积累。对完全测序的真核基因组直系同源基因的分析显示,动植物直系同源基因中甚至在动物,植物和原生生物之间的同源内含子位置都有许多共享的内含子位置,这表明自上次真核生物祖先(LECA)以来,许多祖先内含子一直存在。使用越来越多的真核生物的基因组集合和日益先进的概率模型重建内含子得失,令人信服地表明,LECA和每个真核超族的祖先都有内含子丰富的基因,其内含子密度可与最内含子丰富的人相媲美。现代基因组,例如脊椎动物的基因组。大多数真核生物谱系的后续进化主要涉及内含子的丧失,只有少数内含子获得实质性收获的事件可能伴随着重大的进化创新,例如后生动物的起源。自剪接的II族内含子最初可能是由线粒体内共生体入侵而进入真核生物的基因组,可能是真核生物发生的关键因素,特别是触发了内膜和细胞核的起源。相反,剪接错误导致了选择性剪接,这是多细胞真核生物生物学复杂性的主要贡献。没有迹象表明,除了相对罕见的移动自剪接内含子以外,任何原核生物都没有在蛋白质编码基因中拥有剪接体或内含子。因此,没有任何证据支持内含子优先的情况,但是蛋白质编码基因的外显子-内含子结构似乎已经与真核细胞一起进化,并且内含子是整个真核生物历史上进化的主要因素。这篇文章由I. King Jordan,Manuel Irimia(由Anthony Poole提名),Tobias Mourier(由Anthony Poole提名)和Fyodor Kondrashov进行了评论。有关完整的报告,请参阅“审阅者的报告”部分。
展开▼