首页> 美国卫生研究院文献>Nucleic Acids Research >Detecting laterally transferred genes: use of entropic clustering methods and genome position
【2h】

Detecting laterally transferred genes: use of entropic clustering methods and genome position

机译:检测横向转移的基因:使用熵聚类方法和基因组位置

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Most parametric methods for detecting foreign genes in bacterial genomes use a scoring function that measures the atypicality of a gene with respect to the bulk of the genome. Genes whose features are sufficiently atypical—lying beyond a threshold value—are deemed foreign. Yet these methods fail when the range of features of donor genomes overlaps with that of the recipient genome, leading to misclassification of foreign and native genes; existing parametric methods choose threshold parameters to balance these error rates. To circumvent this problem, we have developed a two-pronged approach to minimize the misclassification of genes. First, beyond classifying genes as merely atypical, a gene clustering method based on Jensen–Shannon entropic divergence identifies classes of foreign genes that are also similar to each other. Second, genome position is used to reassign genes among classes whose composition features overlap. This process minimizes the misclassification of either native or foreign genes that are weakly atypical. The performance of this approach was assessed using artificial chimeric genomes and then applied to the well-characterized Escherichia coli K12 genome. Not only were foreign genes identified with a high degree of accuracy, but genes originating from the same donor organism were effectively grouped.
机译:用于检测细菌基因组中外源基因的大多数参数方法都使用一种计分功能,该功能可测量基因相对于基因组大部分的非典型性。具有非典型特征(超过阈值)的基因被认为是外来的。然而,当供体基因组的特征范围与受体基因组的特征范围重叠时,这些方法将失败,从而导致外源基因和天然基因的错误分类。现有的参数方法选择阈值参数来平衡这些错误率。为了避免这个问题,我们开发了一种两管齐下的方法来最大程度地减少基因的错误分类。首先,除了将基因分类为非典型的以外,基于詹森-香农熵散度的基因聚类方法还可以识别彼此相似的外源基因类别。其次,使用基因组位置在组成特征重叠的类别之间重新分配基因。此过程可最大程度地减少对非典型基因的本地或外来基因的错误分类。使用人工嵌合基因组评估了该方法的性能,然后将其应用于特征明确的大肠杆菌K12基因组。不仅可以高度准确地鉴定外源基因,而且可以有效地对源自同一供体生物的基因进行分组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号