首页> 外文会议>Asia-Pacific Bioinformatics Conference >Simultaneous phylogeny reconstruction and multiple sequence alignment
【24h】

Simultaneous phylogeny reconstruction and multiple sequence alignment

机译:同时系统发生重建与多序列对齐

获取原文

摘要

Background: A phylogeny is the evolutionary history of a group of organisms. To date, sequence data is still the most used data type for phylogenetic reconstruction. Before any sequences can be used for phylogeny reconstruction, they must be aligned,and the quality of the multiple sequence alignment has been shown to affect the quality of the inferred phylogeny. At the same time, all the current multiple sequence alignment programs use a guide tree to produce the alignment and experiments showed that good guide trees can significantly improve the multiple alignment quality.Results: We devise a new algorithm to simultaneously align multiple sequences and search for the phylogenetic tree that leads to the best alignment. We also implemented the algorithm as a C program package, which can handle both DNA and protein data andcan take simple cost model as well as complex substitution matrices, such as PAM250 or BLOSUM62. The performance of the new method are compared with those from other popular multiple sequence alignment tools, including the widely used programs such as ClustalW and T-Coffee. Experimental results suggest that this method has good performance in terms of both phylogeny accuracy and alignment quality.Conclusion: We present an algorithm to align multiple sequences and reconstruct the phylogenies that minimize the alignment score, which is based on an efficient algorithm to solve the median problems for three sequences. Our extensive experiments suggest that this method is very promising and can produce high quality phylogenies and alignments.
机译:背景:文学发生是一组生物的进化史。迄今为止,序列数据仍然是系统发育重建的最常用的数据类型。在任何序列可用于系统发生重建之前,它们必须对齐,并且已经显示了多序列对准的质量来影响推断的系统发育的质量。同时,所有当前的多个序列对齐程序都使用指南生成对齐和实验表明,良好的导向树可以显着提高多个对齐质量。结果:我们设计了一种新算法,同时对齐多个序列并搜索引起最佳对准的系统发育树。我们还将算法作为C程序包实现,其可以处理DNA和蛋白质数据和扫描的替代成本模型以及复杂的替代矩阵,例如PAM250或Blosum62。将新方法的性能与来自其他流行的多序列对准工具的性能进行比较,包括广泛使用的程序,例如Clustalw和T-Coffee。实验结果表明,该方法在系统发生的精度和对准质量方面具有良好的性能。结论:我们介绍了一种算法来对准多个序列并重建最小化对准分数的系统,这是基于求解中位数的算法三个序列的问题。我们广泛的实验表明,这种方法非常有前途,可以产生高质量的系统发育和对准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号