【24h】

Fast Walking Tree Method via Recurrence Reduction for Biological String Alignment

机译:通过递归减少的快速走树方法进行生物字符串对齐

获取原文
获取原文并翻译 | 示例

摘要

The meaning of biological sequences is a central problem of modern biology. Although string matching is well-understood in the edit-distance model, biological strings with translocations and inversions violate this model's assumptions. To align biologically reasonable strings, we proposed the Walking Tree Method, an approximate string alignment method that can handle insertion, deletions, substitutions, translocations, and more than one level of inversions. Our earlier versions were able to align whole bacterial genomes (~1 Mbps) to discover and verify genes. As extremely long sequences can now be deciphered rapidly and accurately without amplification, speeding up the method becomes necessary. Via a technique that we call it "recurrence reduction" in which duplicate computations can be looked up rather than re-computed, we are able to futher reduce the run time from quadratic to sub-quadratic.
机译:生物序列的含义是现代生物学的中心问题。尽管在编辑距离模型中字符串匹配是很容易理解的,但是具有易位和倒置的生物字符串违反了该模型的假设。为了对齐生物学上合理的字符串,我们提出了“步行树方法”,这是一种近似的字符串对齐方法,可以处理插入,删除,替换,易位以及一个以上的反转级别。我们的早期版本能够对齐整个细菌基因组(约1 Mbps)以发现和验证基因。由于现在无需扩增即可快速,准确地破译非常长的序列,因此有必要加快该方法。通过一种我们称为“递归减少”的技术,在该技术中可以查找重复计算而不是重新计算,我们可以进一步将运行时间从二次减少到次二次。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号