Processing Repetitive Sequence Structures with Mismatches at Streaming Rate

机译：以流速率处理不匹配的重复序列结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the accelerating growth of biological databases and the beginning of genome-scale processing, cost-effective high-performance se-quence analysis remains an essential problem in bioinformatics. We ex-amine the use of FPGAs for finding repetitive structures such as tandem repeats and palindromes under various mismatch models. For all prob-lems addressed here, we process strings in streaming mode and obtain processing times of 5ns per character for arbitrary length strings. Using a Xilinx XC2VP100, we can find: (ⅰ) all repeats up to size 1024, each with any number of mismatches; (ⅱ) all precise tandem arrays with repeats up to size 1024; (ⅲ) all palindromes up to size 256, each with any number of mismatches, or (ⅳ) a somewhat smaller size of (ⅰ) and (ⅲ) with a single insertion or deletion. The speed-up factors range from 250 to 6000 over an efficient serial implementation which is itself many times faster than a direct implementation of a theoretically optimal serial algorithm.

机译：随着生物数据库的加速增长和基因组规模处理的开始，具有成本效益的高性能序列分析仍然是生物信息学中的重要问题。我们研究了在各种失配模型下使用FPGA查找重复结构（例如串联重复序列和回文序列）的情况。对于此处解决的所有问题，我们以流模式处理字符串，对于任意长度的字符串，每个字符获得5ns的处理时间。使用Xilinx XC2VP100，我们可以发现：（ⅰ）所有重复最大为1024，每个都有不匹配的任何数目; （ⅱ）重复数最大为1024的所有精确串联阵列; （ⅲ）大小不超过256的所有回文，每个错配有许多不匹配，或者（ⅳ）较小的（ⅰ）和（ⅲ）大小，且一次插入或删除。在有效的串行实现中，加速因子的范围从250到6000，其本身比在理论上最佳的串行算法的直接实现要快许多倍。

著录项

来源
《Field-Programmable Logic and Applications》|2004年|P.1080-1083|共4页
会议地点
作者
Albert A. Conti; Tom Van Court; Martin C. Herbordt;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures [J] . Steve Hoffmann, Christian Otto, Stefan Kurtz, PLoS Computational Biology . 2009,第9期

机译：使用索引结构快速映射具有不匹配，插入和缺失的短序列
2. The contrasting structures of mismatched DNA sequences containing looped-out bases (bulges) and multiple mismatches (bubbles) [J] . Anamitra Bhattacharyya, David M.J. Lilley Nucleic acids research . 1989,第17期

机译：错配的DNA序列的对比结构，其中包含环状碱基（凸起）和多个错配（气泡）
3. The contrasting structures of mismatched DNA sequences containing looped-out bases (bulges) and multiple mismatches (bubbles) [J] . Anamitra Bhattacharyya, David M.J. Lilley Nucleic acids research . 1989,第17期

机译：错配的DNA序列的对比结构，其中包含环状碱基（凸起）和多个错配（气泡）
4. Dot Plot Detects Repetitive Structures in DNA Sequences [C] . Akito Taneda, Toshlo Shimizu Workshop on Genome Informatics . 2002

机译：点绘制检测DNA序列中的重复结构
5. Sequence dependence of stabilities and structures of tandem mismatches and Watson-Crick base pairs in RNA. [D] . Xia, Tianbing. 1999

机译：RNA中串联错配和Watson-Crick碱基对的稳定性和结构的序列依赖性。
6. Structure of two human beta-actin-related processed genes one of which is located next to a simple repetitive sequence. [O] . M Moos, D Gallwitz 1983

机译：两个人β-肌动蛋白相关的加工基因的结构其中一个位于简单的重复序列旁边。
7. Structure of two human beta-actin-related processed genes one of which is located next to a simple repetitive sequence. [O] . M. Moos, D. Gallwitz 1983

机译：与之相关的两个人β-肌动蛋白相关的加工基因的结构，其中一个是简单的重复序列。
8. Stem-loop structures of the repetitive DNA sequences located at human centromeres [R] . Gupta, G., Garcia, A. E., Ratliff, R., 1993

机译：位于人类着丝粒的重复DNa序列的茎环结构

Processing Repetitive Sequence Structures with Mismatches at Streaming Rate

摘要

著录项

相似文献

相关主题

期刊订阅