首页> 外文期刊>PLoS Computational Biology >Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
【24h】

Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection

机译:人工基因组在非典型基因检测评估方法中的应用

获取原文
           

摘要

Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods—as well as the evaluation and proper implementation of existing methods—relies on an arbitrary assessment of performance using real genomes, where the evolutionary histories of genes are not known. We have used the framework of a generalized hidden Markov model to create artificial genomes modeled after genuine genomes. To model a genome, “core” genes—those displaying patterns of mutational biases shared among large numbers of genes—are identified by a novel gene clustering approach based on the Akaike information criterion. Gene models derived from multiple “core” gene clusters are used to generate an artificial genome that models the properties of a genuine genome. Chimeric artificial genomes—representing those having experienced lateral gene transfer—were created by combining genes from multiple artificial genomes, and the performance of the parametric methods for identifying “atypical” genes was assessed directly. We found that a hidden Markov model that included multiple gene models, each trained on sets of genes representing the range of genotypic variability within a genome, could produce artificial genomes that mimicked the properties of genuine genomes. Moreover, different methods for detecting foreign genes performed differently—i.e., they had different sets of strengths and weaknesses—when identifying atypical genes within chimeric artificial genomes.
机译:用于鉴定横向转移基因的参数方法利用了每个基因组特有的定向突变偏向。然而,新的,更强大的方法的开发以及对现有方法的评估和适当实施,都依赖于使用真实基因组的性能的任意评估,而基因的进化历史尚不清楚。我们使用了广义隐马尔可夫模型的框架来创建以真实基因组为模型的人工基因组。为了对基因组建模,通过基于Akaike信息标准的新型基因聚类方法,识别了“核心”基因(显示大量基因之间共享的突变偏倚模式)。来自多个“核心”基因簇的基因模型用于生成模拟真正基因组特性的人工基因组。通过组合来自多个人工基因组的基因,创建了嵌合人工基因组(代表经历过横向基因转移的那些基因组),并直接评估了用于鉴定“非典型”基因的参数方法的性能。我们发现一个包含多个基因模型的隐马尔可夫模型,每个模型都对代表基因组内基因型变异范围的基因集进行训练,可以产生模仿真正基因组特性的人工基因组。此外,在鉴定嵌合人工基因组中的非典型基因时,用于检测外源基因的不同方法表现不同,即它们具有不同的优缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号