首页> 外文期刊>Nucleic acids research >A sensitive, support-vector-machine method for the detection of horizontal gene transfers in viral, archaeal and bacterial genomes
【24h】

A sensitive, support-vector-machine method for the detection of horizontal gene transfers in viral, archaeal and bacterial genomes

机译:一种灵敏的支持向量机方法,用于检测病毒,古细菌和细菌基因组中的水平基因转移

获取原文
           

摘要

In earlier work, we introduced and discussed a generalized computational framework for identifying horizontal transfers. This framework relied on a gene's nucleotide composition, obviated the need for knowledge of codon boundaries and database searches, and was shown to perform very well across a wide range of archaeal and bacterial genomes when compared with previously published approaches, such as Codon Adaptation Index and C + G content. Nonetheless, two considerations remained outstanding: we wanted to further increase the sensitivity of detecting horizontal transfers and also to be able to apply the method to increasingly smaller genomes. In the discussion that follows, we present such a method, Wn-SVM, and show that it exhibits a very significant improvement in sensitivity compared with earlier approaches. Wn-SVM uses a one-class support-vector machine and can learn using rather small training sets. This property makes Wn-SVM particularly suitable for studying small-size genomes, similar to those of viruses, as well as the typically larger archaeal and bacterial genomes. We show experimentally that the new method results in a superior performance across a wide range of organisms and that it improves even upon our own earlier method by an average of 10% across all examined genomes. As a small-genome case study, we analyze the genome of the human cytomegalovirus and demonstrate that Wn-SVM correctly identifies regions that are known to be conserved and prototypical of all beta-herpesvirinae, regions that are known to have been acquired horizontally from the human host and, finally, regions that had not up to now been suspected to be horizontally transferred. Atypical region predictions for many eukaryotic viruses, including the α-, β- and γ-herpesvirinae, and 123 archaeal and bacterial genomes, have been made available online at http://cbcsrv.watson.ibm.com/HGT_SVM/.
机译:在较早的工作中,我们介绍并讨论了用于确定水平转移的通用计算框架。该框架依赖于基因的核苷酸组成,消除了对密码子边界和数据库搜索的了解,并且与以前公布的方法(如密码子适应指数和密码子)相比,在各种古细菌和细菌基因组中表现良好。 C + G含量。尽管如此,两个考虑因素仍然悬而未决:我们希望进一步提高检测水平转移的灵敏度,并希望将该方法应用于越来越小的基因组。在随后的讨论中,我们介绍了Wn-SVM这样的方法,并表明与早期方法相比,它在灵敏度方面表现出非常显着的提高。 Wn-SVM使用一类支持向量机,并且可以使用相当小的训练集进行学习。这种特性使Wn-SVM特别适合研究与病毒相似的小型基因组,以及通常较大的古细菌和细菌基因组。我们通过实验表明,该新方法在各种生物体中均具有优异的性能,即使在我们自己的较早方法中,该方法在所有检查的基因组中也平均提高了10%。作为一个小基因组案例研究,我们分析了人类巨细胞病毒的基因组,并证明Wn-SVM可以正确识别所有β-疱疹病毒的已知保守区域和原型区域,而已知这些区域均已从该区域水平获得。人类宿主,最后怀疑到目前为止尚未被横向转移的地区。可从http://cbcsrv.watson.ibm.com/HGT_SVM/在线获取许多真核病毒的非典型区域预测,包括α-,β-和γ-疱疹病毒以及123个古细菌和细菌基因组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号