首页> 外文会议>International Conference on Bioinformatics and Computational Biology >Long spaced seeds for finding similarities between biological sequences
【24h】

Long spaced seeds for finding similarities between biological sequences

机译:用于在生物序列之间寻找相似性的长间隔的种子

获取原文

摘要

Homology search finds similar segments between two biological sequences, such as DNA or protein sequences. A significant fraction of the computing power in the world is devoted to finding similarities between biological sequences. The introduction of optimal spaced seeds in [Ma et al., Bioinformatics 18 (2002) 440-445] has increased both the sensitivity and the speed of homology search and it has been adopted by many alignment programs such as BLAST. In spite of significant amount of work, there are no algorithms able to compute long good seeds. We present a different approach here by introducing a new measure that has two desired properties: (i) it is highly correlated with sensitivity of spaced seeds and (ii) it is easily computable. Using this measure we give algorithms that compute better seeds than all previous ones. The fact that sensitivity is not required is essential as it enables us to compute very long good seeds, far beyond the size for which sensitivity can be computed.
机译:同源性搜索在两个生物序列之间找到类似的段,例如DNA或蛋白质序列。致力于在生物学序列之间的相似之处致力于在世界上进行大量的计算能力。 [MA等人,生物信息学18(2002)440-445]中的最佳间隔种子的引入增加了同源性搜索的灵敏度和速度,并且许多对准程序(如BLAST)采用。尽管有大量的工作,但没有能够计算长种种子的算法。我们通过引入具有两个所需特性的新措施来提出一种不同的方法:(i)它与间隔种子的灵敏度高,(ii)很容易计算。使用这种措施,我们提供计算比以前所有的种子更好的种子的算法。不需要敏感性的事实是必不可少的,因为它使我们能够计算非常长的好的种子,远远超出可以计算灵敏度的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号