首页> 外文期刊>Molecular biology and evolution >TCS: A New Multiple Sequence Alignment Reliability Measure to Estimate Alignment Accuracy and Improve Phylogenetic Tree Reconstruction
【24h】

TCS: A New Multiple Sequence Alignment Reliability Measure to Estimate Alignment Accuracy and Improve Phylogenetic Tree Reconstruction

机译:TCS:一种新的多序列比对可靠性测量,用于估计比对准确性并改善系统发育树重建

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Multiple sequence alignment (MSA) is a key modeling procedure when analyzing biological sequences. Homology and evolutionary modeling are the most common applications of MSAs. Both are known to be sensitive to the underlying MSA accuracy. In this work, we show how this problem can be partly overcome using the transitive consistency score (TCS), an extended version of the T-Coffee scoring scheme. Using this local evaluation function, we show that one can identify the most reliable portions of an MSA, as judged from BAliBASE and PREFAB structure-based reference alignments. We also show how this measure can be used to improve phylogenetic tree reconstruction using both an established simulated data set and a novel empirical yeast data set. For this purpose, we describe a novel lossless alternative to site filtering that involves overweighting the trustworthy columns. Our approach relies on the T-Coffee framework; it uses libraries of pairwise alignments to evaluate any third party MSA. Pairwise projections can be produced using fast or slow methods, thus allowing a trade-off between speed and accuracy. We compared TCS with Heads-or-Tails, GUIDANCE, Gblocks, and trimAl and found it to lead to significantly better estimates of structural accuracy and more accurate phylogenetic trees. The software is available from www.tcoffee.org/Projects/tcs.
机译:多序列比对 (MSA) 是分析生物序列时的关键建模过程。同源性和进化建模是 MSA 最常见的应用。众所周知,两者都对基础 MSA 准确性敏感。在这项工作中,我们展示了如何使用传递一致性评分 (TCS) 部分克服这个问题,TCS 是 T-Coffee 评分方案的扩展版本。使用这种局部评估功能,我们表明,人们可以识别 MSA 中最可靠的部分,这是从 BAliBASE 和 PREFAB 基于结构的参考比对中判断的。我们还展示了如何使用该措施来改善系统发育树的重建,使用已建立的模拟数据集和新的经验酵母数据集。为此,我们描述了一种新的无损站点过滤替代方案,该替代方案涉及超重可信列。我们的方法依赖于T-Coffee框架;它使用成对对齐的库来评估任何第三方 MSA。可以使用快速或慢速方法进行成对投影,从而允许在速度和精度之间进行权衡。我们将 TCS 与 Heads-or-Tails、GUIDANCE、Gblocks 和 trimAl 进行了比较,发现它能显着更好地估计结构准确性和更准确的系统发育树。该软件可从 www.tcoffee.org/Projects/tcs 获得。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号