首页> 外文期刊>Nucleic Acids Research >BALSA: Bayesian algorithm for local sequence alignment
【24h】

BALSA: Bayesian algorithm for local sequence alignment

机译:BALSA:用于局部序列比对的贝叶斯算法

获取原文
获取原文并翻译 | 示例
           

摘要

The Smith-Waterman algorithm yields a single alignment, which, albeit optimal, can be strongly affected by the choice of the scoring matrix and the gap penalties. Additionally, the scores obtained are dependent upon the lengths of the aligned sequences, requiring a post-analysis conversion. To overcome some of these shortcomings, we developed a Bayesian algorithm for local sequence alignment (BALSA), that takes into account the uncertainty associated with all unknown variables by incorporating in its forward sums a series of scoring matrices, gap parameters and all possible alignments. The algorithm can return both the joint and the marginal optimal alignments, samples of alignments drawn from the posterior distribution and the posterior probabilities of gap penalties and scoring matrices. Furthermore, it automatically adjusts for variations in sequence lengths. BALSA was compared with SSEARCH, to date the best performing dynamic programming algorithm in the detection of structural neighbors. Using the SCOP databases PDB40D-B and PDB90D-B, BALSA detected 19.8 and 41.3% of remote homologs whereas SSEARCH detected 18.4 and 38% at an error rate of 1% errors per query over the databases, respectively.
机译:Smith-Waterman算法产生一个单一的比对,尽管它是最佳的,但会受到评分矩阵和空位罚分的选择的强烈影响。此外,获得的分数取决于比对序列的长度,需要进行分析后转换。为了克服这些缺点中的某些,我们开发了一种用于局部序列比对的贝叶斯算法(BALSA),该算法考虑了与所有未知变量相关的不确定性,方法是将一系列计分矩阵,间隙参数和所有可能的比对合并到其前向总和中。该算法可以返回联合和边际最优比对,从后验分布以及空位罚分和得分矩阵的后验概率得出的比对样本。此外,它会自动调整序列长度的变化。迄今为止,BALSA与SSEARCH进行了比较,迄今为止在结构邻居检测中性能最佳的动态规划算法。使用SCOP数据库PDB40D-B和PDB90D-B,BALSA分别检测到19.8%和41.3%的远程同系物,而SSEARCH在数据库中每个查询的错误率分别为1%和1%,分别检测到18.4和38%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号