...
首页> 外文期刊>BMC Evolutionary Biology >On homology searches by protein Blast and the characterization of the age of genes
【24h】

On homology searches by protein Blast and the characterization of the age of genes

机译:关于蛋白质Blast的同源性搜索和基因年龄的表征

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background It has been shown in a variety of organisms, including mammals, that genes that appeared recently in evolution, for example orphan genes, evolve faster than older genes. Low functional constraints at the time of origin of novel genes may explain these results. However, this observation has been recently attributed to an artifact caused by the inability of Blast to detect the fastest genes in different eukaryotic genomes. Distinguishing between these two possible explanations would be of great importance for any studies dealing with the taxon distribution of proteins and the origin of novel genes. Results Here we used simulations of protein sequences to examine the capacity of Blast to detect proteins of diverse evolutionary rates in the different species of an eukaryotic phylogenetic tree that included metazoans, fungi and plants. We simulated the evolution of protein genes with the same evolutionary rates than those observed in functional mammalian genes and with among-site rate heterogeneity. Under these conditions, we found that only a very small percentage of simulated ancestral eukaryotic proteins was affected by the Blast artifact. We show that the good detectability of Blast is due to the heterogeneity of protein evolutionary rates at different sites, since only a small conserved motif in a sequence suffices to detect its homologues. Our results indicate that Blast, at least when applied within eukaryotes, only misses homologues of extremely fast-evolving sequences, which are rare in the mammalian genome, as well as sequences evolving homogeneously or pseudogenes. Conclusion Although great care should be exercised in the recognition of remote homologues, most functional mammalian genes can be detected in eukaryotic genomes by Blast. That is, the majority of functional mammalian genes are not as fast as for not being detected in other metazoans, fungi or plants, if they had been present in these organisms. Thus, the correlation previously found between age and rate seems not to be due to a pure Blast artifact, at least for mammals. This may have important implications to understand the mechanisms by which novel genes originate.
机译:背景技术已经在包括哺乳动物在内的多种生物中表明,最近进化的基因,例如孤儿基因,比旧的基因进化得更快。新基因起源时功能限制低可能解释了这些结果。然而,该观察结果最近归因于由Blast无法检测不同真核基因组中最快的基因引起的伪像。区分这两种可能的解释对于任何有关蛋白质分类群分布和新基因起源的研究都将非常重要。结果在这里,我们使用蛋白质序列的模拟来检查Blast检测真核系统树的不同物种(包括后生动物,真菌和植物)中不同进化速率蛋白质的能力。我们以与在功能哺乳动物基因中观察到的相同的进化速率和位点间异质性模拟了蛋白质基因的进化速率。在这些条件下,我们发现,只有极少数的模拟祖先真核蛋白质受Blast伪影影响。我们显示,Blast的良好可检测性是由于蛋白质进化率在不同位点的异质性所致,因为序列中只有一个小的保守基序足以检测其同源物。我们的研究结果表明,至少在真核生物中使用Blast时,只会错过在哺乳动物基因组中罕见的极快进化序列的同系物,以及同源进化或假基因进化的序列。结论尽管必须谨慎地识别远处的同源物,但Blast可以在真核基因组中检测到大多数功能性哺乳动物基因。也就是说,大多数功能性哺乳动物基因的速度不如在其他后生动物,真菌或植物中检测到的速度快,如果它们已存在于这些生物中。因此,至少在哺乳动物中,先前发现的年龄与速率之间的相关性似乎不是由于纯的Blast人工制品引起的。这对于理解新基因起源的机制可能具有重要意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号