...
首页> 外文期刊>Molecular phylogenetics and evolution >Fast and accurate genome comparison using genome images: The Extended Natural Vector Method
【24h】

Fast and accurate genome comparison using genome images: The Extended Natural Vector Method

机译:使用基因组的快速和准确的基因组比较:扩展的自然载体方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Using numerical methods for genome comparison has always been of importance in bioinformatics. The Chaos Game Representation (CGR) is an effective genome sequence mapping technology, which converts genome sequences to CGR images. To each CGR image, we associate a vector called an Extended Natural Vector (ENV). The ENV is based on the distribution of intensity values. This mapping produces a one-to-one correspondence between CGR images and their ENVs. We define the distance between two DNA sequences as the distance between their associated ENVs. We cluster and classify several datasets including Influenza A viruses, Bacillus genomes, and Conoidea mitochondrial genomes to build their phylogenetic trees. Results show that our ENV combining CGR method (CGR-ENV) compares favorably in classification accuracy and efficiency against the multiple sequence alignment (MSA) method and other alignment-free methods. The research provides significant insights into the study of phylogeny, evolution, and efficient DNA comparison algorithms for large genomes.
机译:利用基因组比较的数值方法始终是生物信息学的重要性。混沌游戏表示(CGR)是一种有效的基因组序列映射技术,将基因组序列转化为CGR图像。对于每个CGR图像,我们将称为扩展自然载体(ENV)的向量相关联。 ENV基于强度值的分布。该映射产生CGR图像与其envs之间的一对一对应关系。我们将两个DNA序列之间的距离定义为其相关环境之间的距离。我们群集并分类包括流感病毒,杆菌基因组和基体线粒体基因组的多个数据集以构建它们的系统发育树。结果表明,我们的ENV组合CGR方法(CGR-ENV)在分类准确度和效率上对多序列对准(MSA)方法和其他无比目方法进行了比较。该研究对大型基因组的系统发生,演化和高效DNA比较算法提供了重要的见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号