首页> 外文会议>Field-Programmable Logic and Applications >Processing Repetitive Sequence Structures with Mismatches at Streaming Rate
【24h】

Processing Repetitive Sequence Structures with Mismatches at Streaming Rate

机译:以流速率处理不匹配的重复序列结构

获取原文

摘要

With the accelerating growth of biological databases and the beginning of genome-scale processing, cost-effective high-performance se-quence analysis remains an essential problem in bioinformatics. We ex-amine the use of FPGAs for finding repetitive structures such as tandem repeats and palindromes under various mismatch models. For all prob-lems addressed here, we process strings in streaming mode and obtain processing times of 5ns per character for arbitrary length strings. Using a Xilinx XC2VP100, we can find: (ⅰ) all repeats up to size 1024, each with any number of mismatches; (ⅱ) all precise tandem arrays with repeats up to size 1024; (ⅲ) all palindromes up to size 256, each with any number of mismatches, or (ⅳ) a somewhat smaller size of (ⅰ) and (ⅲ) with a single insertion or deletion. The speed-up factors range from 250 to 6000 over an efficient serial implementation which is itself many times faster than a direct implementation of a theoretically optimal serial algorithm.
机译:随着生物数据库的加速增长和基因组规模处理的开始,具有成本效益的高性能序列分析仍然是生物信息学中的重要问题。我们研究了在各种失配模型下使用FPGA查找重复结构(例如串联重复序列和回文序列)的情况。对于此处解决的所有问题,我们以流模式处理字符串,对于任意长度的字符串,每个字符获得5ns的处理时间。使用Xilinx XC2VP100,我们可以发现:(ⅰ)所有重复最大为1024,每个都有不匹配的任何数目; (ⅱ)重复数最大为1024的所有精确串联阵列; (ⅲ)大小不超过256的所有回文,每个错配有许多不匹配,或者(ⅳ)较小的(ⅰ)和(ⅲ)大小,且一次插入或删除。在有效的串行实现中,加速因子的范围从250到6000,其本身比在理论上最佳的串行算法的直接实现要快许多倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号