首页> 外文会议>2010 IEEE/ACS International Conference on Computer Systems and Applications >An optimized filter for finding multiple repeats in DNA sequences
【24h】

An optimized filter for finding multiple repeats in DNA sequences

机译:一种用于在DNA序列中发现多个重复序列的优化过滤器

获取原文
获取外文期刊封面目录资料

摘要

This paper presents new optimizations designed to improve an algorithm at the state-of-the-art for filtering sequences as a preprocessing step to the task of finding multiple repeats allowing a given pairwise edit distance between pairs of occurrences. The target application is to find possibly long repeats having two or more occurrences, such that each couple of occurrences may show substitutions, insertions or deletions in up to 10 to 15 % of their size. Assimilated to multiple alignment, exact detection of multiple repeats is an NP-hard problem. For increasing computation speed while avoiding the use of heuristics, one may use filters that quickly remove large parts of input that do not contain searched repeats. We describe at theoretical level some optimizations that can be applied to the tool that is currently the state-of-the-art for this filtering task. Finally, we exhibit some experiments in which the optimized tool outperforms its original version.
机译:本文提出了一些新的优化方法,这些优化方法旨在改进现有技术中用于过滤序列的算法,以此作为查找多个重复的任务的预处理步骤,从而允许出现的几对之间具有给定的成对编辑距离。目标应用是寻找可能出现两次或两次以上重复的长重复序列,以使每对重复出现的取代,插入或缺失的大小最多占其大小的10%至15%。与多重比对相比,精确检测多个重复序列是一个NP难题。为了提高计算速度,同时又避免使用试探法,可以使用过滤器来快速删除不包含搜索重复项的大部分输入。我们在理论层面上描述了可以应用于该工具的一些优化,这是当前用于此过滤任务的最新技术。最后,我们展示了一些实验,在这些实验中,优化工具的性能优于其原始版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号