...
首页> 外文期刊>Nucleic Acids Research >Connected gene neighborhoods in prokaryotic genomes
【24h】

Connected gene neighborhoods in prokaryotic genomes

机译:原核基因组中的连通基因邻域

获取原文
获取原文并翻译 | 示例
           

摘要

A computational method was developed for delineating connected gene neighborhoods in bacterial and archaeal genomes. These gene neighborhoods are not typically present, in their entirety, in any single genome, but are held together by overlapping, partially conserved gene arrays. The procedure was applied to comparing the orders of orthologous genes, which were extracted from the database of Clusters of orthologous Groups of proteins (COGs), in 31 prokaryotic genomes and resulted in the identification of 188 clusters of gene arrays, which included 1001 of 2890 COGs. These clusters were projected onto actual genomes to produce extended neighborhoods including additional genes, which are adjacent to the genes from the clusters and are transcribed in the same direction, which resulted in a total of 2387 COGs being included in the neighborhoods. Most of the neighborhoods consist predominantly of genes united by a coherent functional theme, but also include a minority of genes without an obvious functional connection to the main theme. We hypothesize that although some of the latter genes might have unsuspected roles, others are maintained within gene arrays because of the advantage of expression at a level that is typical of the given neighborhood. We designate this phenomenon 'genomic hitchhiking'. The largest neighborhood includes 79 genes (COGs) and consists of overlapping, rearranged ribosomal protein superoperons; apparent genome hitchhiking is particularly typical of this neighborhood and other neighborhoods that consist of genes coding for translation machinery components. Several neighborhoods involve previously undetected connections between genes, allowing new functional predictions. Gene neighborhoods appear to evolve via complex rearrangement, with different combinations of genes from a neighborhood fixed in different lineages.
机译:开发了一种计算方法来描绘细菌和古细菌基因组中的连接基因邻域。这些基因邻域通常不完整地存在于任何单个基因组中,而是通过重叠,部分保守的基因阵列保持在一起。该程序用于比较直系同源基因的顺序,直系同源基因是从31个原核生物基因组的直系同源蛋白质簇(COG)的数据库中提取的,并鉴定了188个基因阵列簇,其中包括2890个中的1001个COG。将这些簇投射到实际的基因组上,以产生扩展的邻域,包括其他基因,这些邻域与簇中的基因相邻并以相同方向转录,导致邻域中总共包含2387个COG。大多数邻域主要由连贯的功能主题组成的基因组成,但也包括少数与主要主题没有明显功能联系的基因。我们假设,尽管后者的某些基因可能具有意想不到的作用,但由于在特定邻域中典型表达水平的优势,其他基因仍保留在基因阵列中。我们将这种现象称为“基因组搭便车”。最大的邻域包含79个基因(COG),由重叠的,重排的核糖体蛋白超操纵子组成。明显的基因组搭便车在该邻域和其他由编码翻译机器组件的基因组成的邻域中尤为典型。多个邻域涉及基因之间以前未发现的连接,从而可以进行新的功能预测。基因邻域似乎是通过复杂的重排而进化的,来自不同邻域的基因的不同组合固定在不同的世系中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号