首页> 美国卫生研究院文献>DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes >Re-Annotation of Protein-Coding Genes in 10 Complete Genomes of Neisseriaceae Family by Combining Similarity-Based and Composition-Based Methods
【2h】

Re-Annotation of Protein-Coding Genes in 10 Complete Genomes of Neisseriaceae Family by Combining Similarity-Based and Composition-Based Methods

机译:结合基于相似度和基于成分的方法对10个奈瑟菌科完整基因组中蛋白质编码基因的重新注释

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper, we performed a comprehensive re-annotation of protein-coding genes by a systematic method combining composition- and similarity-based approaches in 10 complete bacterial genomes of the family Neisseriaceae. First, 418 hypothetical genes were predicted as non-coding using the composition-based method and 413 were eliminated from the gene list. Both the scatter plot and cluster of orthologous groups (COG) fraction analyses supported the result. Second, from 20 to 400 hypothetical proteins were assigned with functions in each of the 10 strains based on the homology search. Among newly assigned functions, 397 are so detailed to have definite gene names. Third, 106 genes missed by the original annotations were picked up by an ab initio gene finder combined with similarity alignment. Transcriptional experiments validated the effectiveness of this method in Laribacter hongkongensis and Chromobacterium violaceum. Among the 106 newly found genes, some deserve particular interests. For example, 27 transposases were newly found in Neiserria meningitidis alpha14. In Neiserria gonorrhoeae NCCP11945, four new genes with putative functions and definite names (nusG, rpsN, rpmD and infA) were found and homologues of them usually are essential for survival in bacteria. The updated annotations for the 10 Neisseriaceae genomes provide a more accurate prediction of protein-coding genes and a more detailed functional information of hypothetical proteins. It will benefit research into the lifestyle, metabolism, environmental adaption and pathogenicity of the Neisseriaceae species. The re-annotation procedure could be used directly, or after the adaption of detailed methods, for checking annotations of any other bacterial or archaeal genomes.
机译:在本文中,我们通过结合基于组成和相似性的方法,在奈瑟菌科的10个完整细菌基因组中,通过系统的方法对蛋白质编码基因进行了全面的重新注释。首先,使用基于组成的方法预测了418个假设基因为非编码基因,并且从基因列表中删除了413个假设基因。散点图和直系同源群(COG)分数分析均支持该结果。其次,基于同源性搜索,在10个菌株中的每一个中,从20到400个假设的蛋白质具有功能。在新分配的功能中,397如此详细,以具有确定的基因名称。第三,通过从头算基因查找器结合相似性比对,选择了原始注释遗漏的106个基因。转录实验验证了该方法在香港漆器和紫杆菌中的有效性。在106个新发现的基因中,有些值得特别关注。例如,在脑膜炎奈瑟氏球菌α14中新发现了27种转座酶。在淋病奈瑟氏球菌NCCP11945中,发现了四个具有推定功能和明确名称的新基因(nusG,rpsN,rpmD和infA),它们的同源物通常对于细菌的生存至关重要。更新的10个奈瑟菌科基因组注释可更准确地预测蛋白质编码基因,并提供更详细的假设蛋白质功能信息。这将有助于研究奈瑟菌科物种的生活方式,新陈代谢,环境适应性和致病性。重新注释程序可以直接使用,也可以在详细方法改编后用于检查任何其他细菌或古细菌基因组的注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号