首页> 外文期刊>Informatics in Medicine Unlocked >Stretch Profile: A pruning technique to accelerate DNA sequence search
【24h】

Stretch Profile: A pruning technique to accelerate DNA sequence search

机译:拉伸型材:加速DNA序列搜索的修剪技术

获取原文
           

摘要

DNA sequence similarity search has been used by scientists to facilitate biological research. Over the years, more sequences are added to databases, making them constantly larger. Existing sequence search techniques usually focus on the improvement of sequence search algorithms to make the search faster, ignoring the possibility of reducing unrelated sequences from the search. This paper presents a pruning technique to accelerate DNA sequence search based on a novel Stretch Profile created from stretches of consecutive base characters: A-Stretch, C-Stretch, G-Stretch, and T-Stretch. The Stretch Profile is pre-generated for each sequence in a sequence database. During the search, the Stretch Profile of the query sequence is generated for comparison. The sequences in the database whose profiles do not match the Stretch Profile of the query sequence are excluded from the search, resulting in the reduction of search space, and consequently, search time.For evaluation, we compare sequence retrievals from the Greengenes database and processing time when using only BLAST and when using the proposed pruning technique with BLAST. The results show that the proposed pruning technique can reduce the search time by 30.43% up to 63.74% depending on the length of input query, while maintaining a sensitivity of 1.00 when compared to the result of the original BLAST search.
机译:科学家们使用了DNA序列相似性搜索,以促进生物学研究。多年来,更多的序列被添加到数据库中,使其不断变大。现有的序列搜索技术通常侧重于改进序列搜索算法,以更快地进行搜索,忽略从搜索中减少不相关序列的可能性。本文提出了一种基于从连续底座特征的延伸产生的新型拉伸型材加速DNA序列搜索的修剪技术:A形拉伸,C拉伸,G拉伸和T形拉伸。在序列数据库中为每个序列预先生成拉伸曲线。在搜索过程中,生成查询序列的拉伸轮廓以进行比较。数据库中的序列在搜索中排除了不匹配查询序列的拉伸轮廓的配置文件,从而导致搜索空间的减少,从而进行搜索时间。根据评估,我们将序列检索与Greengenes数据库和处理进行比较使用仅使用爆炸的时间以及使用爆炸的提出的修剪技术。结果表明,根据输入查询的长度,所提出的修剪技术可以将搜索时间减少30.43%,高达63.74%,同时与原始爆炸搜索的结果保持1.00的灵敏度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号