首页> 外文期刊>Nucleic Acids Research >PSAR: measuring multiple sequence alignment reliability by probabilistic sampling
【24h】

PSAR: measuring multiple sequence alignment reliability by probabilistic sampling

机译:PSAR:通过概率抽样测量多序列比对可靠性

获取原文
获取原文并翻译 | 示例
           

摘要

Multiple sequence alignment, which is of fundamental importance for comparative genomics, is a difficult problem and error-prone. Therefore, it is essential to measure the reliability of the alignments and incorporate it into downstream analyses. We propose a new probabilistic sampling-based alignment reliability (PSAR) score. Instead of relying on heuristic assumptions, such as the correlation between alignment quality and guide tree uncertainty in progressive alignment methods, we directly generate suboptimal alignments from an input multiple sequence alignment by a probabilistic sampling method, and compute the agreement of the input alignment with the suboptimal alignments as the alignment reliability score. We construct the suboptimal alignments by an approximate method that is based on pairwise comparisons between each single sequence and the sub-alignment of the input alignment where the chosen sequence is left out. By using simulation-based benchmarks, we find that our approach is superior to existing ones, supporting that the suboptimal alignments are highly informative source for assessing alignment reliability. We apply the PSAR method to the alignments in the UCSC Genome Browser to measure the reliability of alignments in different types of regions, such as coding exons and conserved non-coding regions, and use it to guide cross-species conservation study.
机译:对比较基因组学至关重要的多序列比对是一个难题并且容易出错。因此,必须测量比对的可靠性并将其纳入下游分析中。我们提出了一个新的基于概率抽样的对准可靠性(PSAR)评分。无需依赖启发式假设(例如渐进比对方法中比对质量与指导树不确定性之间的相关性),我们可以通过概率抽样方法直接从输入的多序列比对中生成次优比对,并计算输入比对与次优比对作为比对可靠性得分。我们通过一种近似方法来构建次优比对,该近似方法基于每个单个序列与输入比对的子比对之间的成对比较,其中所选序列被省略。通过使用基于仿真的基准,我们发现我们的方法优于现有的基准,支持次优比对是评估比对可靠性的高信息来源。我们将PSAR方法应用于UCSC基因组浏览器中的比对,以测量不同类型区域(例如编码外显子和保守的非编码区域)中比对的可靠性,并用其指导跨物种保护研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号